We need a developer to create a parser that will automatically collect data from Instagram Reels and TikTok.
The parser should only work with accounts from a list (Google Sheets) and select relevant videos based on clear criteria.
Focus:
Work only with micro-influencers (5,000 – 15,000 followers).
Account selection at the start is done manually, then the parser works with this list.
Only videos with more than 10,000 views are collected.
Posts no older than 45 days.
ER (engagement) of at least 2%.
Filtering out videos with unwanted topics (casino, 18+, politics, etc.).
Data to collect:
- For videos: URL, publication date, views (>10k), likes, comments, reposts/saves (if available), ER%, hashtags/description, language (detect), geo (if possible).
- For accounts: number of followers, average ER for the last 10 videos, average views, posting frequency, contacts (email, link in bio).
Export format:
All data is stored in Google Sheets in a table:
| Platform | Account | Video URL | Followers | Views | Likes | Comments | ER% | Average Account ER | Publication Date | Topic/Hashtags | Contacts |
Working mode:
Requirements for the performer:
Experience in Python (Selenium, Playwright, Scrapy, BeautifulSoup).
Ability to work with APIs (Instagram, TikTok).
Experience with Google Sheets API.
Experience in collecting and filtering large datasets.
Will be a plus:
Experience with anti-blocking (proxy, captcha, fingerprint).
Practice in calculating engagement metrics (ER, average indicators).
Format: remote, payment per project, possibility of long-term cooperation (support and updates).
We look forward to your feedback.