Prom parser
Project: PromParser
Functionality: The script accepts a link to the seller's catalog and automatically goes through all pagination pages (from the first to the last), collecting complete information about each product item.
Stages of the script's work:
1. Link collection (Playwright + Proxy): Browser automation simulates the actions of a real user, navigating through the catalog pages and collecting links to all product cards. The use of proxies helps avoid blocks.
2. Asynchronous loading (Aiohttp + Proxy): The script instantly makes parallel requests to each collected product card. This ensures maximum speed of operation compared to regular browser parsers.
3. Data structuring (BeautifulSoup4): Extraction from the HTML code of specific characteristics: name, direct link, price, and a critically important metric — number of sales.
4. Export to report (Excel / Openpyxl): Data is stored in an Excel spreadsheet with professional formatting: auto-adjusting column widths, bold headers, and active links for the client's convenience.
Functionality: The script accepts a link to the seller's catalog and automatically goes through all pagination pages (from the first to the last), collecting complete information about each product item.
Stages of the script's work:
1. Link collection (Playwright + Proxy): Browser automation simulates the actions of a real user, navigating through the catalog pages and collecting links to all product cards. The use of proxies helps avoid blocks.
2. Asynchronous loading (Aiohttp + Proxy): The script instantly makes parallel requests to each collected product card. This ensures maximum speed of operation compared to regular browser parsers.
3. Data structuring (BeautifulSoup4): Extraction from the HTML code of specific characteristics: name, direct link, price, and a critically important metric — number of sales.
4. Export to report (Excel / Openpyxl): Data is stored in an Excel spreadsheet with professional formatting: auto-adjusting column widths, bold headers, and active links for the client's convenience.