Topic modeling
51 USDUse data from https://github.com/thedenaas/hse_seminars/tree/master/2018/seminar_13/data.zip
Implement model in pytorch from "An Unsupervised Neural Attention Model for Aspect Extraction, He et al, 2017", also desribed in https://github.com/thedenaas/hse_seminars/blob/master/2019/seminar_13/topic.ipynb .
You can use sentence embeddings with attention :
sentence embedding
attention weight for i-th token
attention with trainable matrix 
sentence context
, token embedding of size d
- number of tokens in a sentence
Or just use sentence embedding as an average over word embeddings :
sentence embedding
, token embedding of size d
- number of tokens in a sentence
topic weights for sentence
, with trainable matrix
and bias vector 
reconstructed sentence embedding as a weighted sum of topic embeddings
trainable matrix of topic embeddings, K=number of topics
Training objective:
where
random sentences are sampled as negative examples from dataset
for each sentence 
average of word embeddings in the i-th sentence
regularizer, that enforces matrix
to be orthogonal
Frobenius norm
Compute topic coherence for at least for 3 different number of topics. Use 10 nearest words for each topic. It means you have to train one model for each number of topics. You can use code from seminar notes with word2vec similarity scores.
-
60 Могу выполнить Ваше задание. Есть опыт работы с нейронными сетями и имплементации их с научных статей на pytorch/tensorflow.
О сроках и цене можем договориться.
-
356 8 0 Добрый день!
У меня имеется большой опыт в DataScience. Есть экспертиза в NLP и разработке моделей на pytorch. Пишите, обсудим детали:)
Current freelance projects in the category Python
Improvement of the macro
16 USD
It is necessary to improve the existing macro. The macro itself might be simple; I don't know because it was passed to me by a former employee. The macro is used to create specifications. Since I work in retail, for each operation with the supplier, specifically for deliveries,… Python ∙ 1 minute back |
Creation of a TikTok farm with income generation
602 USD
Looking for a person who can write software for a TikTok farm, so we can generate traffic and earn income. We are seeking a ready-made solution with a full cycle. Python, Bot Development ∙ 1 day back ∙ 15 proposals |
AI Commenting Platform for TikTok and Instagram.Project Goal Develop a system that allows managing a large number of TikTok and Instagram accounts and automatically posting relevant comments under selected videos using AI. Main Functionality1. Account Management It is necessary to implement the ability to connect accounts:… AI & Machine Learning, Python ∙ 2 days 7 hours back ∙ 22 proposals |
Build a customer classification model1. There is client data in Mongo/SQL (approximately 20,000 entries with raw data). 2. It is necessary to build features and a classification model of clients into behavioral groups based on this data. 3. The project should be completed in Python. AI & Machine Learning, Python ∙ 4 days 1 hour back ∙ 43 proposals |
IT Automation of VAT Reporting
223 USD
It is necessary to develop a system for automating the transfer of sales data from the CRM to the accounting system Wafeq. The system should import bank and payment reports, automatically reconcile payments with invoices, generate invoices for VAT reporting, and minimize manual… AI & Machine Learning, Python ∙ 4 days 7 hours back ∙ 51 proposals |