Data parsing (scraping) from marketplaces
From 7 USD
Data parsing (scraping) from marketplaces: collection of products, prices, descriptions into a Google spreadsheet — one-time export or regular monitoring (hourly/daily/weekly).
Stage 1: Data Preparation
• Analysis of target marketplaces (Prom, Rozetka, HotLine, etc.) — determining the structure of pages, key fields, access restrictions.
• Preparation of a Google spreadsheet template in the required format (sheets for products, attributes, image links, update dates, etc.).
Stage 2: Parser Development
• Development of a script/service (Python/Node.js with Scrapy/Playwright/Requests libraries) for collecting: name, description, price, availability, article number, image links, rating.
• Handling pagination, product variants, filters, and dynamic content (AJAX).
• Setting up error handling, retries, rate limits, headers, proxies, and, if necessary, CAPTCHA bypass.
• Formatting data according to the Google spreadsheet template and writing via Google Sheets API.
What is needed from you:
• A list of marketplaces from which to collect information (e.g., Prom, Rozetka, HotLine, etc.).
• Your or preferred spreadsheet template (column format) — if none, I will prepare a basic template.
• Examples of categories/filters and examples of products (optional — for testing).
• Access to a Google account/API or the ability to provide access to the Google spreadsheet for writing (only if automatic writing is needed).
Data parsing (scraping) from marketplaces: collection of products, prices, descriptions into a Google spreadsheet — one-time export or regular monitoring (hourly/daily/weekly).
Stage 1: Data Preparation
• Analysis of target marketplaces (Prom, Rozetka, HotLine, etc.) — determining the structure of pages, key fields, access restrictions.
• Preparation of a Google spreadsheet template in the required format (sheets for products, attributes, image links, update dates, etc.).
Stage 2: Parser Development
• Development of a script/service (Python/Node.js with Scrapy/Playwright/Requests libraries) for collecting: name, description, price, availability, article number, image links, rating.
• Handling pagination, product variants, filters, and dynamic content (AJAX).
• Setting up error handling, retries, rate limits, headers, proxies, and, if necessary, CAPTCHA bypass.
• Formatting data according to the Google spreadsheet template and writing via Google Sheets API.
What is needed from you:
• A list of marketplaces from which to collect information (e.g., Prom, Rozetka, HotLine, etc.).
• Your or preferred spreadsheet template (column format) — if none, I will prepare a basic template.
• Examples of categories/filters and examples of products (optional — for testing).
• Access to a Google account/API or the ability to provide access to the Google spreadsheet for writing (only if automatic writing is needed).