OLX parser with export to Excel (Python, BeautifulSoup)
Python script for automatic collection of ads from the OLX website with subsequent export to a structured Excel file.
Functionality:
Parsing ads from multiple pages
Data collection:
product name
price
city
publication date
link to the ad
Duplicate filtering
Bypassing blocks (headers + delays between requests)
Export to Excel with formatting:
stylized headers
auto column width
row color alternation
frozen top panel
Separate sheet with summarized statistics:
number of ads
minimum / maximum / average price
parsing date
Technologies:
#Python #BeautifulSoup #Requests #OpenPyXL #WebScraping #DataParsing
Functionality:
Parsing ads from multiple pages
Data collection:
product name
price
city
publication date
link to the ad
Duplicate filtering
Bypassing blocks (headers + delays between requests)
Export to Excel with formatting:
stylized headers
auto column width
row color alternation
frozen top panel
Separate sheet with summarized statistics:
number of ads
minimum / maximum / average price
parsing date
Technologies:
#Python #BeautifulSoup #Requests #OpenPyXL #WebScraping #DataParsing