Switch to English?
Yes
Переключитись на українську?
Так
Переключиться на русскую?
Да
Przełączyć się na polską?
Tak
Project description:
Developed a scalable parser for ads from Auto.ria (automotive marketplace) to collect detailed data about cars: make/model, year, mileage, price, configuration, photo links, seller contact information, and other metadata. The parser is designed considering the real limitations of the platforms: it uses proxy rotation, User-Agent switching, parallelism control, and protection against anti-bot mechanisms — all of this allows for stable collection of large volumes of data with minimal risk of blocks.

Functionality:

Mass collection of ads (by categories, filters, price ranges, regions).

Collection of a complete set of fields: title, description, specifications, price, location, photos/gallery, contact information, publication date.

Support for pagination, dynamic loading, and AJAX parts of pages.

Proxy rotation (residential/datacenter), balancing by IP and geography.

Dynamic changing of User-Agent and other HTTP headers.

Semaphores and throttling — control of parallelism to avoid overloading the platform.

Captcha handling (integration with solving services if necessary) and respectful backoff strategies in case of errors.

Deduplication of records (by unique ID or URL), incremental updates, and (optionally) tracking changes in ads.

Saving in convenient formats: PostgreSQL/SQLite, CSV, Excel; export for analytics.

Logging, metrics, and monitoring (number of collected ads, errors, health-check).
Work details
Budget 56 USD
Added 12 November 2025
137 views
Freelancer
Danilo Kanivets
Ukraine Kharkiv  50  1

Available for hire Available for hire
51 Safes completed
On the service 11 months 1 day