Saturday, September 17, 2022

How we used Topic Modeling to assign topics to Millions jobs (Realtime) and gain insights | Architecture

Topic2vec_try
In [64]:
!pip install top2vec
!pip install top2vec[sentence_encoders]
!pip install top2vec[sentence_transformers]
!pip install top2vec[indexing]
!pip install yake
Looking in indexes: https://pypi.org/simple, https://us-python.pkg.dev/colab-wheels/public/simple/
Requirement already satisfied: top2vec in /usr/local/lib/python3.7/dist-packages (1.0.27)
Requirement already satisfied: hdbscan>=0.8.27 in /usr/local/lib/python3.7/dist-packages (from top2vec) (0.8.28)
Requirement already satisfied: pandas in /usr/local/lib/python3.7/dist-packages (from top2vec) (1.3.5)
Requirement already satisfied: numpy>=1.20.0 in /usr/local/lib/python3.7/dist-packages (from top2vec) (1.21.6)
Requirement already satisfied: umap-learn>=0.5.1 in /usr/local/lib/python3.7/dist-packages (from top2vec) (0.5.3)
Requirement already satisfied: gensim>=4.0.0 in /usr/local/lib/python3.7/dist-packages (from top2vec) (4.2.0)
Requirement already satisfied: wordcloud in /usr/local/lib/python3.7/dist-packages (from top2vec) (1.8.2.2)
Requirement already satisfied: scipy>=0.18.1 in /usr/local/lib/python3.7/dist-packages (from gensim>=4.0.0->top2vec) (1.7.3)
Requirement already satisfied: smart-open>=1.8.1 in /usr/local/lib/python3.7/dist-packages (from gensim>=4.0.0->top2vec) (5.2.1)
Requirement already satisfied: cython>=0.27 in /usr/local/lib/python3.7/dist-packages (from hdbscan>=0.8.27->top2vec) (0.29.32)
Requirement already satisfied: scikit-learn>=0.20 in /usr/local/lib/python3.7/dist-packages (from hdbscan>=0.8.27->top2vec) (1.0.2)
Requirement already satisfied: joblib>=1.0 in /usr/local/lib/python3.7/dist-packages (from hdbscan>=0.8.27->top2vec) (1.1.0)
Requirement already satisfied: threadpoolctl>=2.0.0 in /usr/local/lib/python3.7/dist-packages (from scikit-learn>=0.20->hdbscan>=0.8.27->top2vec) (3.1.0)
Requirement already satisfied: tqdm in /usr/local/lib/python3.7/dist-packages (from umap-learn>=0.5.1->top2vec) (4.64.1)
Requirement already satisfied: numba>=0.49 in /usr/local/lib/python3.7/dist-packages (from umap-learn>=0.5.1->top2vec) (0.56.2)
Requirement already satisfied: pynndescent>=0.5 in /usr/local/lib/python3.7/dist-packages (from umap-learn>=0.5.1->top2vec) (0.5.7)
Requirement already satisfied: importlib-metadata in /usr/local/lib/python3.7/dist-packages (from numba>=0.49->umap-learn>=0.5.1->top2vec) (4.12.0)
Requirement already satisfied: llvmlite<0.40,>=0.39.0dev0 in /usr/local/lib/python3.7/dist-packages (from numba>=0.49->umap-learn>=0.5.1->top2vec) (0.39.1)
Requirement already satisfied: setuptools<60 in /usr/local/lib/python3.7/dist-packages (from numba>=0.49->umap-learn>=0.5.1->top2vec) (57.4.0)
Requirement already satisfied: zipp>=0.5 in /usr/local/lib/python3.7/dist-packages (from importlib-metadata->numba>=0.49->umap-learn>=0.5.1->top2vec) (3.8.1)
Requirement already satisfied: typing-extensions>=3.6.4 in /usr/local/lib/python3.7/dist-packages (from importlib-metadata->numba>=0.49->umap-learn>=0.5.1->top2vec) (4.1.1)
Requirement already satisfied: pytz>=2017.3 in /usr/local/lib/python3.7/dist-packages (from pandas->top2vec) (2022.2.1)
Requirement already satisfied: python-dateutil>=2.7.3 in /usr/local/lib/python3.7/dist-packages (from pandas->top2vec) (2.8.2)
Requirement already satisfied: six>=1.5 in /usr/local/lib/python3.7/dist-packages (from python-dateutil>=2.7.3->pandas->top2vec) (1.15.0)
Requirement already satisfied: pillow in /usr/local/lib/python3.7/dist-packages (from wordcloud->top2vec) (7.1.2)
Requirement already satisfied: matplotlib in /usr/local/lib/python3.7/dist-packages (from wordcloud->top2vec) (3.2.2)
Requirement already satisfied: pyparsing!=2.0.4,!=2.1.2,!=2.1.6,>=2.0.1 in /usr/local/lib/python3.7/dist-packages (from matplotlib->wordcloud->top2vec) (3.0.9)
Requirement already satisfied: cycler>=0.10 in /usr/local/lib/python3.7/dist-packages (from matplotlib->wordcloud->top2vec) (0.11.0)
Requirement already satisfied: kiwisolver>=1.0.1 in /usr/local/lib/python3.7/dist-packages (from matplotlib->wordcloud->top2vec) (1.4.4)
Looking in indexes: https://pypi.org/simple, https://us-python.pkg.dev/colab-wheels/public/simple/
Requirement already satisfied: top2vec[sentence_encoders] in /usr/local/lib/python3.7/dist-packages (1.0.27)
Requirement already satisfied: wordcloud in /usr/local/lib/python3.7/dist-packages (from top2vec[sentence_encoders]) (1.8.2.2)
Requirement already satisfied: numpy>=1.20.0 in /usr/local/lib/python3.7/dist-packages (from top2vec[sentence_encoders]) (1.21.6)
Requirement already satisfied: pandas in /usr/local/lib/python3.7/dist-packages (from top2vec[sentence_encoders]) (1.3.5)
Requirement already satisfied: gensim>=4.0.0 in /usr/local/lib/python3.7/dist-packages (from top2vec[sentence_encoders]) (4.2.0)
Requirement already satisfied: hdbscan>=0.8.27 in /usr/local/lib/python3.7/dist-packages (from top2vec[sentence_encoders]) (0.8.28)
Requirement already satisfied: umap-learn>=0.5.1 in /usr/local/lib/python3.7/dist-packages (from top2vec[sentence_encoders]) (0.5.3)
Requirement already satisfied: tensorflow-hub in /usr/local/lib/python3.7/dist-packages (from top2vec[sentence_encoders]) (0.12.0)
Requirement already satisfied: tensorflow in /usr/local/lib/python3.7/dist-packages (from top2vec[sentence_encoders]) (2.8.2+zzzcolab20220719082949)
Requirement already satisfied: tensorflow-text in /usr/local/lib/python3.7/dist-packages (from top2vec[sentence_encoders]) (2.8.2)
Requirement already satisfied: smart-open>=1.8.1 in /usr/local/lib/python3.7/dist-packages (from gensim>=4.0.0->top2vec[sentence_encoders]) (5.2.1)
Requirement already satisfied: scipy>=0.18.1 in /usr/local/lib/python3.7/dist-packages (from gensim>=4.0.0->top2vec[sentence_encoders]) (1.7.3)
Requirement already satisfied: joblib>=1.0 in /usr/local/lib/python3.7/dist-packages (from hdbscan>=0.8.27->top2vec[sentence_encoders]) (1.1.0)
Requirement already satisfied: cython>=0.27 in /usr/local/lib/python3.7/dist-packages (from hdbscan>=0.8.27->top2vec[sentence_encoders]) (0.29.32)
Requirement already satisfied: scikit-learn>=0.20 in /usr/local/lib/python3.7/dist-packages (from hdbscan>=0.8.27->top2vec[sentence_encoders]) (1.0.2)
Requirement already satisfied: threadpoolctl>=2.0.0 in /usr/local/lib/python3.7/dist-packages (from scikit-learn>=0.20->hdbscan>=0.8.27->top2vec[sentence_encoders]) (3.1.0)
Requirement already satisfied: pynndescent>=0.5 in /usr/local/lib/python3.7/dist-packages (from umap-learn>=0.5.1->top2vec[sentence_encoders]) (0.5.7)
Requirement already satisfied: numba>=0.49 in /usr/local/lib/python3.7/dist-packages (from umap-learn>=0.5.1->top2vec[sentence_encoders]) (0.56.2)
Requirement already satisfied: tqdm in /usr/local/lib/python3.7/dist-packages (from umap-learn>=0.5.1->top2vec[sentence_encoders]) (4.64.1)
Requirement already satisfied: setuptools<60 in /usr/local/lib/python3.7/dist-packages (from numba>=0.49->umap-learn>=0.5.1->top2vec[sentence_encoders]) (57.4.0)
Requirement already satisfied: llvmlite<0.40,>=0.39.0dev0 in /usr/local/lib/python3.7/dist-packages (from numba>=0.49->umap-learn>=0.5.1->top2vec[sentence_encoders]) (0.39.1)
Requirement already satisfied: importlib-metadata in /usr/local/lib/python3.7/dist-packages (from numba>=0.49->umap-learn>=0.5.1->top2vec[sentence_encoders]) (4.12.0)
Requirement already satisfied: typing-extensions>=3.6.4 in /usr/local/lib/python3.7/dist-packages (from importlib-metadata->numba>=0.49->umap-learn>=0.5.1->top2vec[sentence_encoders]) (4.1.1)
Requirement already satisfied: zipp>=0.5 in /usr/local/lib/python3.7/dist-packages (from importlib-metadata->numba>=0.49->umap-learn>=0.5.1->top2vec[sentence_encoders]) (3.8.1)
Requirement already satisfied: pytz>=2017.3 in /usr/local/lib/python3.7/dist-packages (from pandas->top2vec[sentence_encoders]) (2022.2.1)
Requirement already satisfied: python-dateutil>=2.7.3 in /usr/local/lib/python3.7/dist-packages (from pandas->top2vec[sentence_encoders]) (2.8.2)
Requirement already satisfied: six>=1.5 in /usr/local/lib/python3.7/dist-packages (from python-dateutil>=2.7.3->pandas->top2vec[sentence_encoders]) (1.15.0)
Requirement already satisfied: grpcio<2.0,>=1.24.3 in /usr/local/lib/python3.7/dist-packages (from tensorflow->top2vec[sentence_encoders]) (1.48.1)
Requirement already satisfied: protobuf<3.20,>=3.9.2 in /usr/local/lib/python3.7/dist-packages (from tensorflow->top2vec[sentence_encoders]) (3.17.3)
Requirement already satisfied: tensorboard<2.9,>=2.8 in /usr/local/lib/python3.7/dist-packages (from tensorflow->top2vec[sentence_encoders]) (2.8.0)
Requirement already satisfied: gast>=0.2.1 in /usr/local/lib/python3.7/dist-packages (from tensorflow->top2vec[sentence_encoders]) (0.5.3)
Requirement already satisfied: libclang>=9.0.1 in /usr/local/lib/python3.7/dist-packages (from tensorflow->top2vec[sentence_encoders]) (14.0.6)
Requirement already satisfied: opt-einsum>=2.3.2 in /usr/local/lib/python3.7/dist-packages (from tensorflow->top2vec[sentence_encoders]) (3.3.0)
Requirement already satisfied: astunparse>=1.6.0 in /usr/local/lib/python3.7/dist-packages (from tensorflow->top2vec[sentence_encoders]) (1.6.3)
Requirement already satisfied: wrapt>=1.11.0 in /usr/local/lib/python3.7/dist-packages (from tensorflow->top2vec[sentence_encoders]) (1.14.1)
Requirement already satisfied: absl-py>=0.4.0 in /usr/local/lib/python3.7/dist-packages (from tensorflow->top2vec[sentence_encoders]) (1.2.0)
Requirement already satisfied: keras-preprocessing>=1.1.1 in /usr/local/lib/python3.7/dist-packages (from tensorflow->top2vec[sentence_encoders]) (1.1.2)
Requirement already satisfied: termcolor>=1.1.0 in /usr/local/lib/python3.7/dist-packages (from tensorflow->top2vec[sentence_encoders]) (1.1.0)
Requirement already satisfied: tensorflow-estimator<2.9,>=2.8 in /usr/local/lib/python3.7/dist-packages (from tensorflow->top2vec[sentence_encoders]) (2.8.0)
Requirement already satisfied: flatbuffers>=1.12 in /usr/local/lib/python3.7/dist-packages (from tensorflow->top2vec[sentence_encoders]) (2.0.7)
Requirement already satisfied: h5py>=2.9.0 in /usr/local/lib/python3.7/dist-packages (from tensorflow->top2vec[sentence_encoders]) (3.1.0)
Requirement already satisfied: tensorflow-io-gcs-filesystem>=0.23.1 in /usr/local/lib/python3.7/dist-packages (from tensorflow->top2vec[sentence_encoders]) (0.26.0)
Requirement already satisfied: google-pasta>=0.1.1 in /usr/local/lib/python3.7/dist-packages (from tensorflow->top2vec[sentence_encoders]) (0.2.0)
Requirement already satisfied: keras<2.9,>=2.8.0rc0 in /usr/local/lib/python3.7/dist-packages (from tensorflow->top2vec[sentence_encoders]) (2.8.0)
Requirement already satisfied: wheel<1.0,>=0.23.0 in /usr/local/lib/python3.7/dist-packages (from astunparse>=1.6.0->tensorflow->top2vec[sentence_encoders]) (0.37.1)
Requirement already satisfied: cached-property in /usr/local/lib/python3.7/dist-packages (from h5py>=2.9.0->tensorflow->top2vec[sentence_encoders]) (1.5.2)
Requirement already satisfied: markdown>=2.6.8 in /usr/local/lib/python3.7/dist-packages (from tensorboard<2.9,>=2.8->tensorflow->top2vec[sentence_encoders]) (3.4.1)
Requirement already satisfied: werkzeug>=0.11.15 in /usr/local/lib/python3.7/dist-packages (from tensorboard<2.9,>=2.8->tensorflow->top2vec[sentence_encoders]) (1.0.1)
Requirement already satisfied: requests<3,>=2.21.0 in /usr/local/lib/python3.7/dist-packages (from tensorboard<2.9,>=2.8->tensorflow->top2vec[sentence_encoders]) (2.23.0)
Requirement already satisfied: tensorboard-data-server<0.7.0,>=0.6.0 in /usr/local/lib/python3.7/dist-packages (from tensorboard<2.9,>=2.8->tensorflow->top2vec[sentence_encoders]) (0.6.1)
Requirement already satisfied: tensorboard-plugin-wit>=1.6.0 in /usr/local/lib/python3.7/dist-packages (from tensorboard<2.9,>=2.8->tensorflow->top2vec[sentence_encoders]) (1.8.1)
Requirement already satisfied: google-auth-oauthlib<0.5,>=0.4.1 in /usr/local/lib/python3.7/dist-packages (from tensorboard<2.9,>=2.8->tensorflow->top2vec[sentence_encoders]) (0.4.6)
Requirement already satisfied: google-auth<3,>=1.6.3 in /usr/local/lib/python3.7/dist-packages (from tensorboard<2.9,>=2.8->tensorflow->top2vec[sentence_encoders]) (1.35.0)
Requirement already satisfied: rsa<5,>=3.1.4 in /usr/local/lib/python3.7/dist-packages (from google-auth<3,>=1.6.3->tensorboard<2.9,>=2.8->tensorflow->top2vec[sentence_encoders]) (4.9)
Requirement already satisfied: cachetools<5.0,>=2.0.0 in /usr/local/lib/python3.7/dist-packages (from google-auth<3,>=1.6.3->tensorboard<2.9,>=2.8->tensorflow->top2vec[sentence_encoders]) (4.2.4)
Requirement already satisfied: pyasn1-modules>=0.2.1 in /usr/local/lib/python3.7/dist-packages (from google-auth<3,>=1.6.3->tensorboard<2.9,>=2.8->tensorflow->top2vec[sentence_encoders]) (0.2.8)
Requirement already satisfied: requests-oauthlib>=0.7.0 in /usr/local/lib/python3.7/dist-packages (from google-auth-oauthlib<0.5,>=0.4.1->tensorboard<2.9,>=2.8->tensorflow->top2vec[sentence_encoders]) (1.3.1)
Requirement already satisfied: pyasn1<0.5.0,>=0.4.6 in /usr/local/lib/python3.7/dist-packages (from pyasn1-modules>=0.2.1->google-auth<3,>=1.6.3->tensorboard<2.9,>=2.8->tensorflow->top2vec[sentence_encoders]) (0.4.8)
Requirement already satisfied: chardet<4,>=3.0.2 in /usr/local/lib/python3.7/dist-packages (from requests<3,>=2.21.0->tensorboard<2.9,>=2.8->tensorflow->top2vec[sentence_encoders]) (3.0.4)
Requirement already satisfied: idna<3,>=2.5 in /usr/local/lib/python3.7/dist-packages (from requests<3,>=2.21.0->tensorboard<2.9,>=2.8->tensorflow->top2vec[sentence_encoders]) (2.10)
Requirement already satisfied: certifi>=2017.4.17 in /usr/local/lib/python3.7/dist-packages (from requests<3,>=2.21.0->tensorboard<2.9,>=2.8->tensorflow->top2vec[sentence_encoders]) (2022.6.15)
Requirement already satisfied: urllib3!=1.25.0,!=1.25.1,<1.26,>=1.21.1 in /usr/local/lib/python3.7/dist-packages (from requests<3,>=2.21.0->tensorboard<2.9,>=2.8->tensorflow->top2vec[sentence_encoders]) (1.24.3)
Requirement already satisfied: oauthlib>=3.0.0 in /usr/local/lib/python3.7/dist-packages (from requests-oauthlib>=0.7.0->google-auth-oauthlib<0.5,>=0.4.1->tensorboard<2.9,>=2.8->tensorflow->top2vec[sentence_encoders]) (3.2.0)
Requirement already satisfied: pillow in /usr/local/lib/python3.7/dist-packages (from wordcloud->top2vec[sentence_encoders]) (7.1.2)
Requirement already satisfied: matplotlib in /usr/local/lib/python3.7/dist-packages (from wordcloud->top2vec[sentence_encoders]) (3.2.2)
Requirement already satisfied: pyparsing!=2.0.4,!=2.1.2,!=2.1.6,>=2.0.1 in /usr/local/lib/python3.7/dist-packages (from matplotlib->wordcloud->top2vec[sentence_encoders]) (3.0.9)
Requirement already satisfied: cycler>=0.10 in /usr/local/lib/python3.7/dist-packages (from matplotlib->wordcloud->top2vec[sentence_encoders]) (0.11.0)
Requirement already satisfied: kiwisolver>=1.0.1 in /usr/local/lib/python3.7/dist-packages (from matplotlib->wordcloud->top2vec[sentence_encoders]) (1.4.4)
Looking in indexes: https://pypi.org/simple, https://us-python.pkg.dev/colab-wheels/public/simple/
Requirement already satisfied: top2vec[sentence_transformers] in /usr/local/lib/python3.7/dist-packages (1.0.27)
Requirement already satisfied: hdbscan>=0.8.27 in /usr/local/lib/python3.7/dist-packages (from top2vec[sentence_transformers]) (0.8.28)
Requirement already satisfied: umap-learn>=0.5.1 in /usr/local/lib/python3.7/dist-packages (from top2vec[sentence_transformers]) (0.5.3)
Requirement already satisfied: pandas in /usr/local/lib/python3.7/dist-packages (from top2vec[sentence_transformers]) (1.3.5)
Requirement already satisfied: gensim>=4.0.0 in /usr/local/lib/python3.7/dist-packages (from top2vec[sentence_transformers]) (4.2.0)
Requirement already satisfied: wordcloud in /usr/local/lib/python3.7/dist-packages (from top2vec[sentence_transformers]) (1.8.2.2)
Requirement already satisfied: numpy>=1.20.0 in /usr/local/lib/python3.7/dist-packages (from top2vec[sentence_transformers]) (1.21.6)
Requirement already satisfied: torch in /usr/local/lib/python3.7/dist-packages (from top2vec[sentence_transformers]) (1.12.1+cu113)
Requirement already satisfied: sentence-transformers in /usr/local/lib/python3.7/dist-packages (from top2vec[sentence_transformers]) (2.2.2)
Requirement already satisfied: smart-open>=1.8.1 in /usr/local/lib/python3.7/dist-packages (from gensim>=4.0.0->top2vec[sentence_transformers]) (5.2.1)
Requirement already satisfied: scipy>=0.18.1 in /usr/local/lib/python3.7/dist-packages (from gensim>=4.0.0->top2vec[sentence_transformers]) (1.7.3)
Requirement already satisfied: joblib>=1.0 in /usr/local/lib/python3.7/dist-packages (from hdbscan>=0.8.27->top2vec[sentence_transformers]) (1.1.0)
Requirement already satisfied: scikit-learn>=0.20 in /usr/local/lib/python3.7/dist-packages (from hdbscan>=0.8.27->top2vec[sentence_transformers]) (1.0.2)
Requirement already satisfied: cython>=0.27 in /usr/local/lib/python3.7/dist-packages (from hdbscan>=0.8.27->top2vec[sentence_transformers]) (0.29.32)
Requirement already satisfied: threadpoolctl>=2.0.0 in /usr/local/lib/python3.7/dist-packages (from scikit-learn>=0.20->hdbscan>=0.8.27->top2vec[sentence_transformers]) (3.1.0)
Requirement already satisfied: pynndescent>=0.5 in /usr/local/lib/python3.7/dist-packages (from umap-learn>=0.5.1->top2vec[sentence_transformers]) (0.5.7)
Requirement already satisfied: numba>=0.49 in /usr/local/lib/python3.7/dist-packages (from umap-learn>=0.5.1->top2vec[sentence_transformers]) (0.56.2)
Requirement already satisfied: tqdm in /usr/local/lib/python3.7/dist-packages (from umap-learn>=0.5.1->top2vec[sentence_transformers]) (4.64.1)
Requirement already satisfied: importlib-metadata in /usr/local/lib/python3.7/dist-packages (from numba>=0.49->umap-learn>=0.5.1->top2vec[sentence_transformers]) (4.12.0)
Requirement already satisfied: llvmlite<0.40,>=0.39.0dev0 in /usr/local/lib/python3.7/dist-packages (from numba>=0.49->umap-learn>=0.5.1->top2vec[sentence_transformers]) (0.39.1)
Requirement already satisfied: setuptools<60 in /usr/local/lib/python3.7/dist-packages (from numba>=0.49->umap-learn>=0.5.1->top2vec[sentence_transformers]) (57.4.0)
Requirement already satisfied: typing-extensions>=3.6.4 in /usr/local/lib/python3.7/dist-packages (from importlib-metadata->numba>=0.49->umap-learn>=0.5.1->top2vec[sentence_transformers]) (4.1.1)
Requirement already satisfied: zipp>=0.5 in /usr/local/lib/python3.7/dist-packages (from importlib-metadata->numba>=0.49->umap-learn>=0.5.1->top2vec[sentence_transformers]) (3.8.1)
Requirement already satisfied: python-dateutil>=2.7.3 in /usr/local/lib/python3.7/dist-packages (from pandas->top2vec[sentence_transformers]) (2.8.2)
Requirement already satisfied: pytz>=2017.3 in /usr/local/lib/python3.7/dist-packages (from pandas->top2vec[sentence_transformers]) (2022.2.1)
Requirement already satisfied: six>=1.5 in /usr/local/lib/python3.7/dist-packages (from python-dateutil>=2.7.3->pandas->top2vec[sentence_transformers]) (1.15.0)
Requirement already satisfied: nltk in /usr/local/lib/python3.7/dist-packages (from sentence-transformers->top2vec[sentence_transformers]) (3.7)
Requirement already satisfied: transformers<5.0.0,>=4.6.0 in /usr/local/lib/python3.7/dist-packages (from sentence-transformers->top2vec[sentence_transformers]) (4.22.1)
Requirement already satisfied: sentencepiece in /usr/local/lib/python3.7/dist-packages (from sentence-transformers->top2vec[sentence_transformers]) (0.1.97)
Requirement already satisfied: torchvision in /usr/local/lib/python3.7/dist-packages (from sentence-transformers->top2vec[sentence_transformers]) (0.13.1+cu113)
Requirement already satisfied: huggingface-hub>=0.4.0 in /usr/local/lib/python3.7/dist-packages (from sentence-transformers->top2vec[sentence_transformers]) (0.9.1)
Requirement already satisfied: requests in /usr/local/lib/python3.7/dist-packages (from huggingface-hub>=0.4.0->sentence-transformers->top2vec[sentence_transformers]) (2.23.0)
Requirement already satisfied: packaging>=20.9 in /usr/local/lib/python3.7/dist-packages (from huggingface-hub>=0.4.0->sentence-transformers->top2vec[sentence_transformers]) (21.3)
Requirement already satisfied: filelock in /usr/local/lib/python3.7/dist-packages (from huggingface-hub>=0.4.0->sentence-transformers->top2vec[sentence_transformers]) (3.8.0)
Requirement already satisfied: pyyaml>=5.1 in /usr/local/lib/python3.7/dist-packages (from huggingface-hub>=0.4.0->sentence-transformers->top2vec[sentence_transformers]) (6.0)
Requirement already satisfied: pyparsing!=3.0.5,>=2.0.2 in /usr/local/lib/python3.7/dist-packages (from packaging>=20.9->huggingface-hub>=0.4.0->sentence-transformers->top2vec[sentence_transformers]) (3.0.9)
Requirement already satisfied: tokenizers!=0.11.3,<0.13,>=0.11.1 in /usr/local/lib/python3.7/dist-packages (from transformers<5.0.0,>=4.6.0->sentence-transformers->top2vec[sentence_transformers]) (0.12.1)
Requirement already satisfied: regex!=2019.12.17 in /usr/local/lib/python3.7/dist-packages (from transformers<5.0.0,>=4.6.0->sentence-transformers->top2vec[sentence_transformers]) (2022.6.2)
Requirement already satisfied: click in /usr/local/lib/python3.7/dist-packages (from nltk->sentence-transformers->top2vec[sentence_transformers]) (7.1.2)
Requirement already satisfied: idna<3,>=2.5 in /usr/local/lib/python3.7/dist-packages (from requests->huggingface-hub>=0.4.0->sentence-transformers->top2vec[sentence_transformers]) (2.10)
Requirement already satisfied: chardet<4,>=3.0.2 in /usr/local/lib/python3.7/dist-packages (from requests->huggingface-hub>=0.4.0->sentence-transformers->top2vec[sentence_transformers]) (3.0.4)
Requirement already satisfied: certifi>=2017.4.17 in /usr/local/lib/python3.7/dist-packages (from requests->huggingface-hub>=0.4.0->sentence-transformers->top2vec[sentence_transformers]) (2022.6.15)
Requirement already satisfied: urllib3!=1.25.0,!=1.25.1,<1.26,>=1.21.1 in /usr/local/lib/python3.7/dist-packages (from requests->huggingface-hub>=0.4.0->sentence-transformers->top2vec[sentence_transformers]) (1.24.3)
Requirement already satisfied: pillow!=8.3.*,>=5.3.0 in /usr/local/lib/python3.7/dist-packages (from torchvision->sentence-transformers->top2vec[sentence_transformers]) (7.1.2)
Requirement already satisfied: matplotlib in /usr/local/lib/python3.7/dist-packages (from wordcloud->top2vec[sentence_transformers]) (3.2.2)
Requirement already satisfied: cycler>=0.10 in /usr/local/lib/python3.7/dist-packages (from matplotlib->wordcloud->top2vec[sentence_transformers]) (0.11.0)
Requirement already satisfied: kiwisolver>=1.0.1 in /usr/local/lib/python3.7/dist-packages (from matplotlib->wordcloud->top2vec[sentence_transformers]) (1.4.4)
Looking in indexes: https://pypi.org/simple, https://us-python.pkg.dev/colab-wheels/public/simple/
Requirement already satisfied: top2vec[indexing] in /usr/local/lib/python3.7/dist-packages (1.0.27)
Requirement already satisfied: pandas in /usr/local/lib/python3.7/dist-packages (from top2vec[indexing]) (1.3.5)
Requirement already satisfied: gensim>=4.0.0 in /usr/local/lib/python3.7/dist-packages (from top2vec[indexing]) (4.2.0)
Requirement already satisfied: hdbscan>=0.8.27 in /usr/local/lib/python3.7/dist-packages (from top2vec[indexing]) (0.8.28)
Requirement already satisfied: wordcloud in /usr/local/lib/python3.7/dist-packages (from top2vec[indexing]) (1.8.2.2)
Requirement already satisfied: umap-learn>=0.5.1 in /usr/local/lib/python3.7/dist-packages (from top2vec[indexing]) (0.5.3)
Requirement already satisfied: numpy>=1.20.0 in /usr/local/lib/python3.7/dist-packages (from top2vec[indexing]) (1.21.6)
Requirement already satisfied: hnswlib in /usr/local/lib/python3.7/dist-packages (from top2vec[indexing]) (0.6.2)
Requirement already satisfied: scipy>=0.18.1 in /usr/local/lib/python3.7/dist-packages (from gensim>=4.0.0->top2vec[indexing]) (1.7.3)
Requirement already satisfied: smart-open>=1.8.1 in /usr/local/lib/python3.7/dist-packages (from gensim>=4.0.0->top2vec[indexing]) (5.2.1)
Requirement already satisfied: joblib>=1.0 in /usr/local/lib/python3.7/dist-packages (from hdbscan>=0.8.27->top2vec[indexing]) (1.1.0)
Requirement already satisfied: scikit-learn>=0.20 in /usr/local/lib/python3.7/dist-packages (from hdbscan>=0.8.27->top2vec[indexing]) (1.0.2)
Requirement already satisfied: cython>=0.27 in /usr/local/lib/python3.7/dist-packages (from hdbscan>=0.8.27->top2vec[indexing]) (0.29.32)
Requirement already satisfied: threadpoolctl>=2.0.0 in /usr/local/lib/python3.7/dist-packages (from scikit-learn>=0.20->hdbscan>=0.8.27->top2vec[indexing]) (3.1.0)
Requirement already satisfied: numba>=0.49 in /usr/local/lib/python3.7/dist-packages (from umap-learn>=0.5.1->top2vec[indexing]) (0.56.2)
Requirement already satisfied: tqdm in /usr/local/lib/python3.7/dist-packages (from umap-learn>=0.5.1->top2vec[indexing]) (4.64.1)
Requirement already satisfied: pynndescent>=0.5 in /usr/local/lib/python3.7/dist-packages (from umap-learn>=0.5.1->top2vec[indexing]) (0.5.7)
Requirement already satisfied: llvmlite<0.40,>=0.39.0dev0 in /usr/local/lib/python3.7/dist-packages (from numba>=0.49->umap-learn>=0.5.1->top2vec[indexing]) (0.39.1)
Requirement already satisfied: setuptools<60 in /usr/local/lib/python3.7/dist-packages (from numba>=0.49->umap-learn>=0.5.1->top2vec[indexing]) (57.4.0)
Requirement already satisfied: importlib-metadata in /usr/local/lib/python3.7/dist-packages (from numba>=0.49->umap-learn>=0.5.1->top2vec[indexing]) (4.12.0)
Requirement already satisfied: zipp>=0.5 in /usr/local/lib/python3.7/dist-packages (from importlib-metadata->numba>=0.49->umap-learn>=0.5.1->top2vec[indexing]) (3.8.1)
Requirement already satisfied: typing-extensions>=3.6.4 in /usr/local/lib/python3.7/dist-packages (from importlib-metadata->numba>=0.49->umap-learn>=0.5.1->top2vec[indexing]) (4.1.1)
Requirement already satisfied: python-dateutil>=2.7.3 in /usr/local/lib/python3.7/dist-packages (from pandas->top2vec[indexing]) (2.8.2)
Requirement already satisfied: pytz>=2017.3 in /usr/local/lib/python3.7/dist-packages (from pandas->top2vec[indexing]) (2022.2.1)
Requirement already satisfied: six>=1.5 in /usr/local/lib/python3.7/dist-packages (from python-dateutil>=2.7.3->pandas->top2vec[indexing]) (1.15.0)
Requirement already satisfied: pillow in /usr/local/lib/python3.7/dist-packages (from wordcloud->top2vec[indexing]) (7.1.2)
Requirement already satisfied: matplotlib in /usr/local/lib/python3.7/dist-packages (from wordcloud->top2vec[indexing]) (3.2.2)
Requirement already satisfied: cycler>=0.10 in /usr/local/lib/python3.7/dist-packages (from matplotlib->wordcloud->top2vec[indexing]) (0.11.0)
Requirement already satisfied: pyparsing!=2.0.4,!=2.1.2,!=2.1.6,>=2.0.1 in /usr/local/lib/python3.7/dist-packages (from matplotlib->wordcloud->top2vec[indexing]) (3.0.9)
Requirement already satisfied: kiwisolver>=1.0.1 in /usr/local/lib/python3.7/dist-packages (from matplotlib->wordcloud->top2vec[indexing]) (1.4.4)
Looking in indexes: https://pypi.org/simple, https://us-python.pkg.dev/colab-wheels/public/simple/
Collecting yake
  Downloading yake-0.4.8-py2.py3-none-any.whl (60 kB)
     |████████████████████████████████| 60 kB 1.7 MB/s 
Requirement already satisfied: numpy in /usr/local/lib/python3.7/dist-packages (from yake) (1.21.6)
Requirement already satisfied: click>=6.0 in /usr/local/lib/python3.7/dist-packages (from yake) (7.1.2)
Requirement already satisfied: networkx in /usr/local/lib/python3.7/dist-packages (from yake) (2.6.3)
Collecting segtok
  Downloading segtok-1.5.11-py3-none-any.whl (24 kB)
Requirement already satisfied: tabulate in /usr/local/lib/python3.7/dist-packages (from yake) (0.8.10)
Collecting jellyfish
  Downloading jellyfish-0.9.0.tar.gz (132 kB)
     |████████████████████████████████| 132 kB 4.2 MB/s 
Requirement already satisfied: regex in /usr/local/lib/python3.7/dist-packages (from segtok->yake) (2022.6.2)
Building wheels for collected packages: jellyfish
  Building wheel for jellyfish (setup.py) ... done
  Created wheel for jellyfish: filename=jellyfish-0.9.0-cp37-cp37m-linux_x86_64.whl size=73982 sha256=8935da914e689aa323c085f347b4ef61734d1c61dc255fbcf90ed30459ef551c
  Stored in directory: /root/.cache/pip/wheels/fe/99/4e/646ce766df0d070b0ef04db27aa11543e2767fda3075aec31b
Successfully built jellyfish
Installing collected packages: segtok, jellyfish, yake
Successfully installed jellyfish-0.9.0 segtok-1.5.11 yake-0.4.8
In [18]:
import pandas as pd
import numpy as np
import json
import os
import uuid
from top2vec import Top2Vec
The cache for model files in Transformers v4.22.0 has been updated. Migrating your old cache. This is a one-time only operation. You can interrupt this and resume the migration later on by calling `transformers.utils.move_cache()`.
Moving 0 files to the new cache system
In [14]:
df  = pd.read_csv('data job posts.csv', error_bad_lines=False, encoding='utf-8')
/usr/local/lib/python3.7/dist-packages/IPython/core/interactiveshell.py:3326: FutureWarning: The error_bad_lines argument has been deprecated and will be removed in a future version.


  exec(code_obj, self.user_global_ns, self.user_ns)
In [15]:
df.head(2)
Out[15]:
jobpost date Title Company AnnouncementCode Term Eligibility Audience StartDate Duration ... Salary ApplicationP OpeningDate Deadline Notes AboutC Attach Year Month IT
0 AMERIA Investment Consulting Company\r\nJOB TI... Jan 5, 2004 Chief Financial Officer AMERIA Investment Consulting Company NaN NaN NaN NaN NaN NaN ... NaN To apply for this position, please submit a\r\... NaN 26 January 2004 NaN NaN NaN 2004 1 False
1 International Research & Exchanges Board (IREX... Jan 7, 2004 Full-time Community Connections Intern (paid i... International Research & Exchanges Board (IREX) NaN NaN NaN NaN NaN 3 months ... NaN Please submit a cover letter and resume to:\r\... NaN 12 January 2004 NaN The International Research & Exchanges Board (... NaN 2004 1 False

2 rows × 24 columns

In [16]:
df_analyze = df[["jobpost", "Title"]]
In [17]:
df_analyze.head(2)
Out[17]:
jobpost Title
0 AMERIA Investment Consulting Company\r\nJOB TI... Chief Financial Officer
1 International Research & Exchanges Board (IREX... Full-time Community Connections Intern (paid i...
In [20]:
documents = df["jobpost"].to_list()
In [21]:
model = Top2Vec(documents, embedding_model='universal-sentence-encoder')
2022-09-17 14:59:46,664 - top2vec - INFO - Pre-processing documents for training
INFO:top2vec:Pre-processing documents for training
/usr/local/lib/python3.7/dist-packages/sklearn/utils/deprecation.py:87: FutureWarning: Function get_feature_names is deprecated; get_feature_names is deprecated in 1.0 and will be removed in 1.2. Please use get_feature_names_out instead.
  warnings.warn(msg, category=FutureWarning)
2022-09-17 15:00:12,008 - top2vec - INFO - Downloading universal-sentence-encoder model
INFO:top2vec:Downloading universal-sentence-encoder model
2022-09-17 15:00:38,560 - top2vec - INFO - Creating joint document/word embedding
INFO:top2vec:Creating joint document/word embedding
2022-09-17 15:01:15,005 - top2vec - INFO - Creating lower dimension embedding of documents
INFO:top2vec:Creating lower dimension embedding of documents
2022-09-17 15:02:05,503 - top2vec - INFO - Finding dense areas of documents
INFO:top2vec:Finding dense areas of documents
2022-09-17 15:02:09,646 - top2vec - INFO - Finding topics
INFO:top2vec:Finding topics
In [22]:
model.get_num_topics()
Out[22]:
58
In [34]:
topic_words, word_scores, topic_scores, topic_nums = model.search_topics(keywords=["software"], num_topics=3)
for topic in topic_nums:
    model.generate_topic_wordcloud(topic)
In [39]:
documents, document_scores, document_ids = model.search_documents_by_topic(topic_num=5, num_docs=3)
for doc, score, doc_id in zip(documents, document_scores, document_ids):
    print(f"Document: {doc_id}, Score: {score}")
    print("-----------")
    print(doc)
    print("-----------")
    print()
Document: 11086, Score: 0.9887646436691284
-----------
Ardinnotech LLC
TITLE:  Software Developer
LOCATION:  Yerevan, Armenia
JOB DESCRIPTION:  The incumbent will develop software systems according
to clients' requirements and continue debugging, enhancing and
maintaining developed systems. Primary responsibilities include data
analyzing skills in combination with developing and debugging database
driven web based enterprise level software using C# programming language
and SQL.
JOB RESPONSIBILITIES:
- Develop software code based on the existing design;
- Be flexible in learning and applying different programming languages
and technologies;
- Constantly update technical knowledge and skills;
- Be efficient in the use of memory and CPU time;
- Employ a good code writing style that includes proper naming and
sufficient comments;
- Update/ write technical documentation on codes and algorithms;
- Work productively as a key member of a software development team;
- Actively participate in discussions regarding technical issues;
- Possibility to travel to USA or other countries to work with other team
members.
REQUIRED QUALIFICATIONS:
- At least 2 years of work experience with C#/.NET;
- Bachelor's or higher degree in Computer Sciences, Physics, Mathematics
or in related disciplines;
- Good grades in undergraduate and other studies;
- Strong problem-solving skills and ability to be a successful member of
a team;
- Strong knowledge of C#, Microsoft .Net Framework 4.0, Silverlight,
ASP.NET MVC3, EF and WCF;
- Working experience with programming user interfaces;
- Working experience with relational databases and SQL language;
- Ability to develop efficient codes;
- Familiarity with algorithms and ability to estimate algorithm
complexity;
- Familiarity with data structures;
- Good English language skills.
Desired qualifications:
- Familiarity with Java, C++ or other object oriented programming
languages;
- Familiarity with Javascript and related frameworks (JQuery and YUI);
- Ability to write programs using different programming languages and
environments;
- Familiarity with different web development technologies (AJAX, HTML and
CSS);
- Familiarity with architectural and design patterns.
REMUNERATION/ SALARY:  Highly competitive, based on background and
experience.
APPLICATION PROCEDURES:  Interested candidates should e-mail their CVs
and inquiries to: Armen Nahapetyan at: nahapetyan.armen@... .
Please clearly mention in your application letter that you learned of
this job opportunity through Career Center and mention the URL of its
website - www.careercenter.am, Thanks.
OPENING DATE:  14 February 2012
APPLICATION DEADLINE:  13 March 2012
ABOUT COMPANY:  The head office of Ardinnotech LLC is based in Florida,
USA. The
company specializes in development of computer models and software for
railroad and transportation companies.
----------------------------------
To place a free posting for job or other career-related opportunities
available in your organization, just go to the www.careercenter.am
website and follow the "Post an Announcement" link.
-----------

Document: 15575, Score: 0.9851416349411011
-----------
Ardinnotech LLC
TITLE:  Software Developer
TERM:  Full time
LOCATION:  Yerevan, Armenia
JOB DESCRIPTION:  The incumbent will develop software systems according
to clients' requirements, and continue debugging, enhancing and
maintaining developed systems. Primary responsibilities include data
analyzing skills in combination with developing and debugging database
driven web based enterprise level software using C# programming language
and SQL.
JOB RESPONSIBILITIES:
- Develop software code based on the existing design;
- Constantly update technical knowledge and skills;
- Employ a good code writing style that includes proper naming and
sufficient comments;
- Update/ write technical documentation on codes and algorithms;
- Work productively as a key member of a software development team;
- Actively participate in discussions regarding technical issues.
REQUIRED QUALIFICATIONS:
- At least 1 year of working experience with C#/.NET;
- Bachelor's or higher degree in Computer Sciences, Physics, Mathematics
or related disciplines;
- Good grades in undergraduate and other studies;
- Strong problem-solving skills and ability to be a successful member of
a team;
- Strong knowledge of C#, Microsoft .Net Framework 4.0, Silverlight,
ASP.NET MVC3, EF, WCF;
- Working experience with MVC and MVVM patterns;
- Working experience with programming user interfaces;
- Working experience with relational databases and SQL language;
- Ability to develop efficient codes;
- Familiarity with algorithms, ability to estimate algorithm complexity;
- Familiarity with data structures;
- Good English language skills including reading, writing, and speaking;
- Flexibility in learning and applying different programming languages
and technologies;
- Efficiency in the use of memory and CPU time;
- Ability to travel to USA or other countries to work with other team
members.
DESIRED QUALIFICATIONS:
- Familiarity with Java, C++, or other object oriented programming
languages;
- Familiarity with Javascript and related frameworks (JQuery, YUI,
etc.);
- Ability to write programs using different programming languages and
environments;
- Familiarity with different web development technologies (AJAX, HTML,
CSS, etc.);
- Familiarity with architectural and design patterns;
REMUNERATION/ SALARY:  Highly competitive salary based on tests and
interview results, experience.
APPLICATION PROCEDURES:  Interested candidates should email their CVs and
inquiries to: nahapetyan.armen@... , to Armen Nahapetyan.
Please clearly mention in your application letter that you learned of
this job opportunity through Career Center and mention the URL of its
website - www.careercenter.am, Thanks.
OPENING DATE:  09 April 2014
APPLICATION DEADLINE:  08 May 2014
ABOUT COMPANY:  Ardinnotech's head office is based in Florida, USA. The
main specialization of the Company is developing and licensing
large-scale computer models and software.
----------------------------------
To place a free posting for job or other career-related opportunities
available in your organization, just go to the www.careercenter.am
website and follow the "Post an Announcement" link.
-----------

Document: 16093, Score: 0.9846581220626831
-----------
Ardinnotech LLC
TITLE:  Software Developer
TERM:  Full time
LOCATION:  Yerevan, Armenia
JOB DESCRIPTION:  The incumbent will develop software systems according
to clients' requirements, and continue debugging, enhancing and
maintaining developed systems. Primary responsibilities include data
analyzing skills in combination with developing and debugging database
driven web based enterprise level software using C# programming language
and SQL.
JOB RESPONSIBILITIES:
- Develop software code based on the existing design;
- Constantly update technical knowledge and skills;
- Employ a good code writing style that includes proper naming and
sufficient comments;
- Update/ write technical documentation on codes and algorithms;
- Work productively as a key member of a software development team;
- Actively participate in discussions regarding technical issues.
REQUIRED QUALIFICATIONS:
- At least 1 year of working experience with C#/ .NET;
- Bachelor's or higher degree in Computer Sciences, Physics, Mathematics
or related disciplines;
- Good grades in undergraduate and other studies;
- Strong problem-solving skills and ability to be a successful member of
a team;
- Strong knowledge of C#, Microsoft .Net Framework 4.0, Silverlight,
ASP.NET MVC3, EF, WCF;
- Working experience with MVC and MVVM patterns;
- Working experience with programming user interfaces;
- Working experience with relational databases and SQL language;
- Ability to develop efficient codes;
- Familiarity with algorithms, ability to estimate algorithm complexity;
- Familiarity with data structures;
- Good English language skills including reading, writing, and speaking;
- Flexibility in learning and applying different programming languages
and technologies;
- Efficiency in the use of memory and CPU time;
- Ability to travel to USA or other countries to work with other team
members.
DESIRED QUALIFICATIONS:
- Familiarity with Java, C++, or other object oriented programming
languages;
- Familiarity with Javascript and related frameworks (JQuery, YUI,
etc.);
- Ability to write programs using different programming languages and
environments;
- Familiarity with different web development technologies (AJAX, HTML,
CSS, etc.);
- Familiarity with architectural and design patterns.
REMUNERATION/ SALARY:  Highly competitive salary based on tests and
interview results, experience.
APPLICATION PROCEDURES:  Interested candidates should email their CVs and
inquiries to: nahapetyan.armen@... , to Armen Nahapetyan.
Please clearly mention in your application letter that you learned of
this job opportunity through Career Center and mention the URL of its
website - www.careercenter.am, Thanks.
OPENING DATE:  09 July 2014
APPLICATION DEADLINE:  08 August 2014
ABOUT COMPANY:  Ardinnotech's head office is based in Florida, USA. The
main specialization of the company is developing and licensing
large-scale computer models and software.
----------------------------------
To place a free posting for job or other career-related opportunities
available in your organization, just go to the www.careercenter.am
website and follow the "Post an Announcement" link.
-----------

In [40]:
words, word_scores = model.similar_words(keywords=["software"], keywords_neg=[], num_words=5)
for word, score in zip(words, word_scores):
    print(f"{word} {score}")
program 0.6808494586077558
programme 0.6437439214069385
computer 0.6267092955854692
programs 0.620686473209616
programmes 0.5974383736481907
In [99]:
INDEED_SAMPLE_JOB = """
ull Job Description
Your Job

If you are an engineer who's passionate about building impactful products that scale to tens of millions of page views a day, Indeed is looking for you. Indeed offers skilled developers like you a complex development ecosystem with short release cycles. Every week sees the new release of multiple products that meet the needs of job seekers worldwide.

The Signals and Routing team is a sub-team within Business Enablement in the SMB platform. The core of its functionality is around determining eligibility of sales leads assignments and routing that leads to the appropriate global sales team.Tech stack is predominantly backend development with Java, Spring Boot, and Kafka. There is a very small amount of React used for front ends for operations team members.

Responsibilities:
Design, develop, and maintain pipelines that create search indexes out of data from large databases
Design and implement efficient algorithms for ranking search results, increasing recall, filtering spam and personalization
Extract, cluster or classify high-quality features from unstructured data
Continually improve search quality and performance
Code innovative tools to support rapid experimentation and learning
Who You Are
Bachelor’s degree required in Computer Science, Electrical Engineering, Computer Engineering or Mathematics or equivalent field required
3+ years experience in Java, Python, or C++ - Java preferred
Spring Boot, React and Kafka Nice to have, but not required
Who we are
The Small to Medium Businesses (SMB) organization at Indeed develops products centered around the hiring needs of SMB businesses, making the hiring process simpler, faster and more effective so they always find the talent that is right for their business, when they need it. The solutions we provide are focused on three main pillars, including ‘Building for the Long-Term’, ‘Delivering the Hire’ and ‘Driving Performance’. The impact of our work helps grow companies and communities around the world.
Our Mission
As the world’s number 1 job site*, our mission is to help people get jobs. We strive to cultivate an inclusive and accessible workplace where all people feel comfortable being themselves. We're looking to grow our teams with more people who share our enthusiasm for innovation and creating the best experience for job seekers.

(*comScore Total Visits, September 2021)
Salary Range Disclaimer
The base salary range represents the low and high end of the Indeed salary range for this position. Actual salaries will vary depending on factors including but not limited to location, experience, and performance. The range listed is just one component of Indeed's total compensation package for employees. Other rewards may include quarterly bonuses, Restricted Stock Units (RSUs), an open Paid Time Off policy, and many region-specific benefits.
Salary Range Transparency
US Remote 109,000 - 159,000 USD per year

Austin 109,000 - 159,000 USD per year

NYC Metro Area 134,000 - 194,000 USD per year

SF Bay Area 140,000 - 204,000 USD per year

Seattle 132,000 - 192,000 USD per year

Equal Opportunities and Accommodations Statement
Indeed is deeply committed to building a workplace and global community where inclusion is not only valued, but prioritized. We’re proud to be an equal opportunity employer, seeking to create a welcoming and diverse environment. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, family status, marital status, sexual orientation, national origin, genetics, neuro-diversity, disability, age, or veteran status, or any other non-merit based or legally protected grounds.

Indeed is committed to providing reasonable accommodations to qualified individuals with disabilities in the employment application process. To request an accommodation, please contact Talent Attraction Help at 1-855-567-7767, or by email at TAhelp@indeed.com at least one week in advance of your interview.

Fair Chance Hiring
We value diverse experiences, including those who have had prior contact with the criminal legal system. We are committed to providing individuals with criminal records, including formerly incarcerated individuals, a fair chance at employment.

Our Policies and Benefits
View Indeed's Applicant Privacy and Accessibility Policies - https://www.indeed.com/legal/indeed-jobs
Learn about our global employee perks, programs and benefits - https://benefits.indeed.jobs/

Where legally permitted, Indeed requires all individuals attending or working out of Indeed offices or visiting Indeed clients to be fully vaccinated against COVID-19. For positions that can only be performed at an Indeed office, candidates must be fully vaccinated against COVID-19 and present acceptable proof of vaccination by the date of hire as a condition of employment. For positions that require some in-office work or in-person client meetings, exceptions to these in-office or in-person job requirements may be made at the discretion of the business through June 2022, at which point full vaccination will be required. Indeed will consider requests for reasonable accommodation as required under applicable law. To qualify as being fully vaccinated against COVID-19 there should have been a two week period after receiving the second dose (or any government recommended booster shot) in a 2-dose COVID-19 vaccine series, or a two week period after receiving a single-dose (or any government recommended booster shot) in a single dose COVID-19 vaccine.
"""
In [100]:
from keybert import KeyBERT
kw_model = KeyBERT()
keywords = kw_model.extract_keywords(INDEED_SAMPLE_JOB)
print(kw_model.extract_keywords(INDEED_SAMPLE_JOB, keyphrase_ngram_range=(1, 1), stop_words=None))
array = [k for k,score in keywords]
print("array", array)
[('jobs', 0.4386), ('developers', 0.4344), ('tech', 0.4157), ('engineer', 0.4094), ('engineering', 0.3841)]
array ['jobs', 'developers', 'tech', 'engineer', 'engineering']
In [101]:
topic_words, word_scores, topic_scores, topic_nums = model.search_topics(keywords=array, num_topics=3)
for topic in topic_nums:
    model.generate_topic_wordcloud(topic)
In [74]:
documents, document_scores, document_ids = model.search_documents_by_topic(topic_num=45, num_docs=2)
for doc, score, doc_id in zip(documents, document_scores, document_ids):
    print(f"Document: {doc_id}, Score: {score}")
    print("-----------")
    print(doc)
    print("-----------")
    print()
Document: 17776, Score: 0.9942688345909119
-----------
Career Center Partner Company
TITLE:  Embedded Linux BSP Engineer
TERM:  Full time
START DATE/ TIME:  ASAP
LOCATION:  Yerevan, Armenia
JOB DESCRIPTION:  We are seeking several candidates for the Embedded
Linux BSP Engineer position to join our partner company's Board Support
Package (BSP) development / board bring-up team. This team is responsible
for performing initial board bring-up on new hardware, porting board
support packages from reference designs, and developing custom device
drivers.
JOB RESPONSIBILITIES:
- Cooperate with design and test engineers throughout the product
development cycle;
- Work with embedded hardware and system engineers to define and design
system solutions;
- Customize boot-loader and Linux operating system for embedded
platforms;
- Review hardware schematics and establish methods to support the
hardware design with low-level software components;
- Work with senior staff to resolve design problems;
- Contribute to technical documentation;
- Integrate software and hardware components to meet requirements and
design specifications.
REQUIRED QUALIFICATIONS:
- Bachelors degree in Computer Sciences or Electrical Engineering
(Masters degree a plus);
- At least 5 years of experience in embedded software development;
- Hands-on software development with the C programming language;
- Energetic and comfortable personality;
- Ability to work in a fast-paced environment;
- Strong written and verbal communication skills;
- Knowledge of using multi-core processors is a plus;
- Experience with PowerPC, ARM and MIPS 32-bit;
- Working knowledge of Linux driver development;
- Working knowledge of the Linux File System, how to build a root file
system and building Linux distributions;
- Working knowledge of u-boot;
- Some experience in hardware engineering a plus.
REMUNERATION/ SALARY:  800,000 - 1,200,000 AMD
APPLICATION PROCEDURES:  To apply for this position, please register in
www.careerhouse.com website as a regular user (unless you have previously
registered), fill out your Resume (including your recent Photo), then
open this announcement, click on the "Apply" button in the upper right
corner of the announcement and proceed to receive the confirmation
message.
Only short listed applicants will be contacted for the interview.
Please clearly mention in your application letter that you learned of
this job opportunity through Career Center and mention the URL of its
website - www.careercenter.am, Thanks.
OPENING DATE:  25 May 2015
APPLICATION DEADLINE:  08 June 2015
ABOUT COMPANY:  Our Partner Company is a board level design and
manufacturing company focused on quick-turn designs for the embedded
market. The company dedicates itself to introducing leading edge
technology for the most demanding applications, and it differentiates
itself with its extraordinary capability to rapidly create new designs in
response to complex customer requirements.
----------------------------------
To place a free posting for job or other career-related opportunities
available in your organization, just go to the www.careercenter.am
website and follow the "Post an Announcement" link.
-----------

Document: 16642, Score: 0.994168221950531
-----------
Career Center Partner Company
TITLE:  Embedded Linux BSP Engineer
TERM:  Full-time
START DATE/ TIME:  ASAP
LOCATION:  Yerevan, Armenia
JOB DESCRIPTION:  We are seeking several candidates for the Embedded
Linux BSP Engineer position to join our partner company's Board Support
Package (BSP) development / board bring-up team. This team is responsible
for performing initial board bring-up on new hardware, porting board
support packages from reference designs, and developing custom device
drivers.
JOB RESPONSIBILITIES:
- Cooperate with design and test engineers throughout the product
development cycle;
- Work with embedded hardware and system engineers to define and design
system solutions;
- Customize boot-loader and Linux operating system for embedded
platforms;
- Review hardware schematics and establish methods to support the
hardware design with low-level software components;
- Work with senior staff to resolve design problems;
- Contribute to technical documentation;
- Integrate software and hardware components to meet requirements and
design specifications.
REQUIRED QUALIFICATIONS:
- Bachelors degree in Computer Sciences or Electrical Engineering
(Masters degree a plus);
- At least 5 years of experience in embedded software development;
- Hands-on software development with the C programming language;
- Energetic and comfortable personality;
- Ability to work in a fast-paced environment;
- Strong written and verbal communication skills;
- Knowledge of using multi-core processors is a plus;
- Experience with PowerPC, ARM and MIPS 32-bit;
- Working knowledge of Linux driver development;
- Working knowledge of the Linux File System, how to build a root file
system and building Linux distributions;
- Working knowledge of u-boot;
- Some experience in hardware engineering a plus.
REMUNERATION/ SALARY:  800,000 - 1,200,000 AMD
APPLICATION PROCEDURES:  To apply for this position, please register in
www.careerhouse.com website as a regular user (unless you have previously
registered), fill out your Resume (including your recent Photo), then
open this announcement, click on the "Apply" button in the upper right
corner of the announcement and proceed to receive the confirmation
message.
Only short listed applicants will be contacted for the interview.
Please clearly mention in your application letter that you learned of
this job opportunity through Career Center and mention the URL of its
website - www.careercenter.am, Thanks.
OPENING DATE:  22 October 2014
APPLICATION DEADLINE:  05 November 2014
ABOUT COMPANY:  Our Partner Company is a board level design and
manufacturing company focused on quick-turn designs for the embedded
market. The company dedicates itself to introducing leading edge
technology for the most demanding applications, and it differentiates
itself with its extraordinary capability to rapidly create new designs in
response to complex customer requirements.
----------------------------------
To place a free posting for job or other career-related opportunities
available in your organization, just go to the www.careercenter.am
website and follow the "Post an Announcement" link.
-----------

Developer Guide: Getting Started with Flink (PyFlink) and Hudi - Setting Up Your Local Environment and Performing CRUD Operations via flink

flink-hudi-final Install Flink and Python ¶ conda info --envs # Create ENV conda ...