Technical specification: parser for sellers/products with filters and multi-launch capability
Need a specialist in web scraping and anti-bot protections.
The site actively blocks proxies and scrapers (constant CAPTCHA / redirect to anti-bot pages).
It is necessary to implement a stable multithreaded parsing with bypassing DataDome / Cloudflare, so that the site does not return errors (403 / 429 / captcha-loop).
Main requirements
1) Multithreading / Asynchronous
stable operation of 40+ parallel threads (negotiable)
request rate control, queues, backoff
without performance degradation
2) Proxies
support for HTTP / SOCKS
automatic rotation on CAPTCHA, bans, errors
ability to use a separate proxy for each thread
consideration of IP reputation
3) Bypassing anti-bot
emulation of a real browser (Playwright / Selenium headless or similar)
realistic headers, user-agent, cookies, delays
obtaining and maintaining a valid session
automatic CAPTCHA solving (2Captcha / CapSolver or similar)
4) Resilience
handling blocks and redirects
auto-recovery of sessions
logging errors and reasons for blocks
Technologies
Language: Python (preferred)
Result
working code / module
launch instructions and recommendations on limits / proxies
confirmation of stable operation under protection
details in private messages
-
хоть бы сайт указали...
-
Скажіть сайт, тоді поговоримо
-
Напишіть посилання на сайт
-
Current freelance projects in the category Data Parsing
Collection of B2B database of companies in Germany
40 USD
Goal: To obtain a list of potential employers (clients) for B2B mailing. Region: Munich (München) + radius of 50 km. Required niches: Construction companies (Bauunternehmen) Food enterprises (Lebensmittelhersteller, meat processing plants, bakeries) Logistics and… Data Parsing, Lead Generation & Sales ∙ 1 hour 38 minutes back ∙ 12 proposals |
Carrier databaseInterested in compiling a database of carriers in Ukraine for the year 2026, including tankers, tarpaulins, grain carriers, and others. It is preferable to develop a table. Data Parsing ∙ 2 hours 55 minutes back ∙ 19 proposals |
Consultation on parsing Instagram account subscribersHello. It is necessary to conduct a preliminary assessment of the feasibility of the following task. I have a list of Instagram accounts. The goal is to obtain contact information (primarily email addresses) of users who follow these accounts. Previously, I encountered companies… Data Parsing ∙ 3 days 18 hours back ∙ 12 proposals |
A specialist is needed to find contacts of decision-makers in Ukraine.It is necessary to gather a database (or ready database) of contacts of decision-makers (DMs) in companies in Ukraine. Information Gathering, Data Parsing ∙ 3 days 22 hours back ∙ 17 proposals |
Need to scrape data from LinkedInWe need to scrape data from LinkedIn based on our list. For each entry, we need to find and collect available data if it exists on the LinkedIn profile, including the profile picture on the LinkedIn social network, email address, links to social media, company website, and… Data Parsing ∙ 4 days 4 hours back ∙ 28 proposals |