Switch to English?
Yes
Переключитись на українську?
Так
Переключиться на русскую?
Да
Przełączyć się na polską?
Tak

Data collection from the Slovak Ministry of Justice register

Development of a Python script for automating the collection of data from the Commercial Register of the Slovak Ministry of Justice.

The script uses:

requests for fetching web pages,
BeautifulSoup for parsing HTML,
ThreadPoolExecutor for multithreading and speeding up the process,
xlsxwriter and openpyxl for saving data in Excel format.

Key Tasks:

Overcome the website’s limitation on the number of records returned per query.
Implement an iterative and optimized data scraping process.

Results:

Successfully collected and processed over 300,000 records.
The solution demonstrated high scalability and reliability.
Data was prepared in a format convenient for analysis.
Work details
Budget 344 USD
Added 10 December 2024
296 views
Freelancer
Yaroslav Dmytriiev
Ukraine Kyiv  1  0

Available for hire Available for hire
1 Safe completed
On the service 1 year