Topic modeling
51 USDUse data from https://github.com/thedenaas/hse_seminars/tree/master/2018/seminar_13/data.zip
Implement model in pytorch from "An Unsupervised Neural Attention Model for Aspect Extraction, He et al, 2017", also desribed in https://github.com/thedenaas/hse_seminars/blob/master/2019/seminar_13/topic.ipynb .
You can use sentence embeddings with attention :
sentence embedding
attention weight for i-th token
attention with trainable matrix 
sentence context
, token embedding of size d
- number of tokens in a sentence
Or just use sentence embedding as an average over word embeddings :
sentence embedding
, token embedding of size d
- number of tokens in a sentence
topic weights for sentence
, with trainable matrix
and bias vector 
reconstructed sentence embedding as a weighted sum of topic embeddings
trainable matrix of topic embeddings, K=number of topics
Training objective:
where
random sentences are sampled as negative examples from dataset
for each sentence 
average of word embeddings in the i-th sentence
regularizer, that enforces matrix
to be orthogonal
Frobenius norm
Compute topic coherence for at least for 3 different number of topics. Use 10 nearest words for each topic. It means you have to train one model for each number of topics. You can use code from seminar notes with word2vec similarity scores.
-
80 Могу выполнить Ваше задание. Есть опыт работы с нейронными сетями и имплементации их с научных статей на pytorch/tensorflow.
О сроках и цене можем договориться.
-
356 8 0 Добрый день!
У меня имеется большой опыт в DataScience. Есть экспертиза в NLP и разработке моделей на pytorch. Пишите, обсудим детали:)
Current freelance projects in the category Python
Excel Specialist / Process Automation (Excel + preferably programming)We are looking for a specialist with ADVANCED knowledge of Excel to optimize the existing file and automate processes. It will be a great advantage if you also have programming skills / VBA / Power Query / Power Automate or experience in creating complex logic in Excel. Project… Python, Databases & SQL ∙ 18 minutes back ∙ 5 proposals |
Automation of processes through API and PythonBelow I described the current process and the result I would like to achieve. I also attach files of the real process to better understand how it looks in reality Current process Currently, the entire process is performed manually: uploading/downloading files, transferring… AI & Machine Learning, Python ∙ 1 hour 19 minutes back ∙ 18 proposals |
A bot needs to be created in Telegram for subscription payment.
45 USD
A bot needs to be created in Telegram where users can subscribe for access to the webcams located in the yard. Organize payment for two types of subscriptions (monthly and daily) in the bot. The bot should automatically check the payment and then provide access links. Python, Bot Development ∙ 14 hours 6 minutes back ∙ 67 proposals |
Parsing and classification of a large array of imagesIt is necessary to implement a project for collecting and structuring a large array of architectural images from open web sources.The task includes: automated collection of images; uploading files in the highest available quality; classification of images by categories:… Python, Data Parsing ∙ 21 hours 6 minutes back ∙ 30 proposals |
Business logic of the platform: class confirmation, attendance control, and lesson history (DjangoRefinement of the business logic of the educational platform: lesson confirmation, attendance control, and lesson history (Django + React) A complete system for lesson confirmation, attendance control, and storage of confirmation history needs to be implemented. Important… Python ∙ 3 days 2 hours back ∙ 29 proposals |