Product photo parser from an online store
Created a multithreaded utility for mass collection and optimization of product photos from an online store.
Main features:
— Automatic crawling of the site's XML sitemap
— Parallel downloading through a thread pool
— Conversion of images to JPEG while preserving quality
— Fault tolerance during server errors and timeouts
— Graphical interface with a progress bar and live log
Technical part:
— Python 3
— requests, BeautifulSoup4, lxml
— Pillow / PIL
— concurrent.futures, Tkinter
The project was implemented "turnkey."
Main features:
— Automatic crawling of the site's XML sitemap
— Parallel downloading through a thread pool
— Conversion of images to JPEG while preserving quality
— Fault tolerance during server errors and timeouts
— Graphical interface with a progress bar and live log
Technical part:
— Python 3
— requests, BeautifulSoup4, lxml
— Pillow / PIL
— concurrent.futures, Tkinter
The project was implemented "turnkey."