Data Parsing
177-
B2B database from Prom.ua: 5-step pipeline with contact enrichme
Data ParsingTask: collect a B2B database of Prom.ua stores by product categories
(sports goods, furniture, auto parts etc.) with owner contacts
for cold outreach campaigns.
… Pipeline (5 steps):
— Step 1: Scrape Prom.ua by category → deduplication
— Step 2: Extract phone numbers (owner's mobile) and websites
— Step 3: EDRPOU enrichment → business type, director's name
— Step 4: Facebook Pixel check on all websites (auto-script)
— Step 5: Export to Google Sheets (gspread API) + Excel
#scraping #prom #b2b #python #edrpou #google_sheets #pipeline
-
Initial public release of `otomoto_parser`.
Data ParsingConfigurable Otomoto.pl car listing scraper with SQLite storage, CLI search, CSV/XLSX export, and Playwright fallback.
-
Job Scraper - Collection of Job Vacancies Python
Data ParsingAutomated system for collecting job vacancies for Python developers from employment websites with subsequent storage in an SQLite database.
-
Data collection from Google Maps
Data ParsingData collection from Google Maps in any city/region.
Selection options:
- Categories (Cafes, pharmacies, service stations, etc...)
- Location (City or region)
…
Saving the collected data in an xlsx file.
Technologies
#python
#selenium
#pandas
#beautifulsoup4
#Parsing #Google_Maps by the search query "Cafes of Chernihiv" #Parsing_Google_Maps
-
Amazon scrapper
Data ParsingThe script is designed for scraping data from the website www.amazon.com. It is written in Python using the Selenium library. An SQLite database is used to store the results.
#Python #Selenium #SQLite
-
45 USD Cleaning the database of duplicates
Data ParsingCleaning the database of duplicates
-
100 USD PromUa Automatic Parcer
Data ParsingWe have written a parser with constant monitoring and data extraction from the PromUA website.
We are ready to adapt it to your requests and needs.
We collaborate both for one-time projects and with ongoing data updates.
We assist with integration into your software/marketplace/CRM.
… We provide data in a format convenient for you (CSV, XLSX, and others).
#parser #Automation #webscraping #saas #sales
#automation
#webscraping
#datascraping
#parser
#scraper
#saas
#b2b
#itservices
#softwaredevelopment
#datacollection
#dataanalysis
#monitoring
#pricemonitoring
#productmonitoring
#stockmonitoring
#pricingstrategy
#marketanalysis
#realtimedata
#integration
#crm
#erp
#api
#marketplace
#ecommerce
#onlinestore
#productcatalog
#products
#sellers
#competitoranalysis
#priceintelligence
#promua
-
100 USD OLX/PromUa Partner
Data ParsingWe have written an automatic parser with constant monitoring of OLX/PromUA websites.
We are ready to adapt it to your requests and needs.
We collaborate both for one-time projects and with ongoing data updates.
We assist with integration into your software/marketplace/CRM.
… We provide data in a format convenient for you (CSV, XLSX, and others).
#parser #Automation #webscraping #saas #sales
#automation
#webscraping
#datascraping
#datacollection
#datamining
#parser
#scraper
#bots
#saas
#b2b
#b2bservices
#customsoftware
#softwaredevelopment
#itservices
#digitaltransformation
#processautomation
#businessautomation
#leadgeneration
#salesautomation
#dataintegration
#apiintegration
#crm
#marketplace
#etl
#bigdata
#realtimedata
#monitoring
#olxautomation
#olxparser
#olxmonitoring
#realtimeparser
#customparser
#datatocsv
#xlsxexport
#api
#crmbridge
#marketplaceautomation
#priceintelligence
#datadriven
-
33 USD Online store parser (products/prices/stock)
Data ParsingSituation:
The client needs a tool for automatic information gathering from an online store: name, photo, price, description, stock, category.
What I did:
… - Wrote a parser that traverses the entire catalog
- Automatic photo upload
- Retrieved price, specifications, availability, SKU
- Handled pagination and complex HTML structures
- Bypassed anti-bot protection (timings, headers, dynamic delays)
- Exported data to PostgreSQL and Google Sheets
- Implemented a launch schedule via cron + reports
Result:
- The client automatically updates prices and products without human intervention
- Savings of ~60 hours of work per month
- Ability to import data into their website/CRM
- Stable operation of the parser 24/7
Technologies:
Python, Selenium, BeautifulSoup, lxml, Asyncio, PostgreSQL, GSheets API.
-
Tennis matches data parser for Melbet (melbet.com) — dataset col
Data ParsingI developed a synchronous data parser for tennis matches from the betting website Melbet (melbet.com), designed specifically to collect a clean dataset for future machine learning models.
The parser runs in near real time and uses Selenium WebDriver to navigate tennis sections and event pages. It iterates through the required DOM nodes and extracts structured data: tournament, players, start time, markets and odds. The parsing speed is controlled (tunable delays between requests and page transitions) to keep the process stable and avoid overloading the website.
… Collected data is cleaned, validated and stored in an MS SQL Server database using a normalized schema (matches, tournaments, markets, odds). On top of that, I implemented CSV export so that the data can be easily used for analytics and for training ML models (e.g. for odds or match outcome prediction).
I designed and implemented the whole solution: database schema, synchronous crawling logic with rate limiting, Selenium error handling, data mapping into SQL tables and the CSV export module.
Tech stack: C#, .NET, Selenium WebDriver, MS SQL Server, ADO.NET / ORM, CSV export, ML-ready dataset preparation.
-
Parsing of the maximum product of Rozetka
Data ParsingTelegram bot for automatic tracking of product stock on Rozetka with data synchronization in Google Sheets.
What it does:
… 1. accepts product links and saves them in the database
2. automatically checks availability and updates stock on a schedule
3. synchronizes data with Google Sheets and exports history
4. supports bulk addition of products (up to 20 at a time)
5. allows manual stock checking and table updating
6. stores check history and generates reports
7. can delete and edit products
8. works in automatic and manual modes
9. logs all operations and restarts on errors
The bot is fully asynchronous, error-resistant, can restart, log all operations, and update data on a schedule.
-
30 USD Web Parsing
Data ParsingWeb Scraping: Turn the Internet into Your Database
Open the Doors to Endless Knowledge
Modern business requires up-to-date data. Web scraping is not just about gathering information; it is a strategic tool that allows you to instantly transform millions of web pages into clean, structured, and analysis-ready resources.
… What do we do?
We create reliable and fast scrapers that mimic human actions, collecting necessary information from any websites — from open data to dynamic pages protected by JavaScript.
Price Monitoring: Regular collection of competitor prices, promotional offers, and product availability information.
Content Gathering: Extracting texts, product descriptions, reviews, and metadata for SEO analysis or to fill your platform.
Lead Generation: Collecting contact details and company information for marketing campaigns.
Key Advantages:
Cleanliness and Structure: We deliver data already formatted in convenient formats for you (JSON, CSV, Google Sheets, XML), ready for direct import.
Resilience to Changes: Our scrapers are designed with protection against blocks and automatically adapt to minor changes in the source site's structure.
Automated Mode: Setting up regular automatic data uploads (daily, hourly, weekly) so your database is always up to date.
File Handling: The ability to automatically upload attached files (PDFs, images, documents) during scraping.
Our Approach
Source Analysis: Studying the structure of the site and its protection mechanisms.
Development of a Specialized Scraper: Creating a unique tool that works efficiently and discreetly.
Validation and Control: Multi-level quality and completeness checks of the collected information before final delivery.
Gain a competitive advantage based on facts, not assumptions.