Development of a solution for parsing/collecting data from API/websites and gathering contacts for outbound/sales.
Develop a solution (script/service/Make.com scenario) that:
regularly retrieves data from specified sources (API and/or websites),
performs website parsing
finds new entities that are not in our internal list,
applies configured filters,
finds the official website of the entity,
collects contacts of people in relevant roles for launching outbound/sales campaigns,
stores the result in a database/storage + provides export.
Expected Result
A ready solution that creates and maintains a contact database in the structure:
Project website
Email
LinkedIn profile link
Job title
Twitter/X profile link
Phone number
Telegram handle or Telegram link
Additionally (preferably): date of finding, source (source), validation/completeness status of the contact.
Data Sources and Processing Streams (3 streams)
Case 1 — Source of the “new entries catalog” type (API/web)
Functions:
Retrieve entries for a period and/or incrementally (new since the last run).
Identify “new” entities relative to our internal list (CSV/Google Sheet/JSON).
Apply filtering (configured).
For each entity:
find the official website,
collect contacts for the required roles,
record in the database/storage.
Case 2 — Aggregators/directories (multiple independent sources)
Functions:
Retrieve data from multiple aggregators/directories (via API or scraping).
Merge/normalize entries, perform deduplication.
Identify “new” entities (not present in our internal list).
Apply filtering (configured).
Find the official website and contacts for the required roles.
Store the result.
Stream 3 — Comparing presence between sources
Functions:
Parse/retrieve data from 3–5 platforms/directories (API/web).
Compare the “presence” of the entity between sources (matching by name/domain/identifier).
Identify entities that:
are not in our internal list,
meet the filters,
For relevant entities — extract contacts + save.
Contact Collection Logic
For each selected entity:
Determine the official website (from the source data or through additional transitions/checks).
Collect contacts for roles (configured list of roles), for example:
Founder / C-level
Head/Lead of Partnerships / BD / Sales / Growth
Head/Lead of Operations / Product (and others, to be agreed)
Contact sources:
“Team”, “About”, “Contact”, “Careers”, “Press” pages
LinkedIn (links to profiles/company, if available)
Twitter/X, Telegram, email/forms, phones
Minimum requirement: store contacts only in the specified field structure.
Data, Storage, Export Requirements
Storage: PostgreSQL or SQLite (the executor proposes the optimal option) or storage with export to Google Sheets.
Deduplication:
by domain/website
by email
by LinkedIn URL
by Twitter/X URL
by phone/Telegram
Export: at least 1 option from (CSV / Google Sheets / Airtable), preferably 2.
Technical requirements:
logs,
error handling,
retry/backoff,
rate limiting.
Preferably:
status “processed / already contacted” for entries.
Configuration (mandatory)
Filters and rules must be changeable without rewriting code:
via config file (YAML/JSON) or
via launch parameters or
via simple admin configuration (if service).
Technologies (acceptable options)
Python (Requests/Scrapy + Playwright) or Node.js (Playwright/Puppeteer) or Make.com.
Priority: official APIs (if available), otherwise — careful scraping.
Docker (preferably).
Scheduler: cron/Celery/other scheduler.
Run on VPS.
We will provide
Internal list for exclusion (CSV/Google Sheet/JSON).
Initial list of filters and roles.
API keys (if needed).
The executor must provide
Repository with code or Make.com scenarios (with access/export).
README: launch, configuration, dependencies.
Architecture description (brief) + data flow diagram.
Demonstration run: log/screen/example of export results.
Executor Requirements
Experience in web scraping / API integrations / lead generation automation.
Understanding of rate limits, anti-bot protection, stability, deduplication.
Examples of similar works (links/descriptions).
Client's review of cooperation with Oleksandr Bilak
Development of a solution for parsing/collecting data from API/websites and gathering contacts for outbound/sales.Everything is fine, the task has been completed according to the agreements.
Freelancer's review of cooperation with Ruslan Petrov
Development of a solution for parsing/collecting data from API/websites and gathering contacts for outbound/sales.Detailed technical specification without many additional discussions. Payment is quick!
-
It is always interesting to learn about unique tools for collecting and analyzing data. Imagine a solution that not only automates the collection of contacts but does so efficiently through filtering, deduplication, and filter customization without recoding. My experience in web development and API integrations will allow me to create an innovative service by adapting the best practices of parsing and data processing. By implementing this, I will ensure accurate interaction between sources and reliable storage in databases. I am ready to discuss how this system can become your powerful enhancement.
-
1422 13 0 Hello. I have experience in parsing, API integrations, and automation; I have implemented similar solutions before.
I propose implementation in Python without Make, in the form of a script with configuration.
Briefly, here is how I see the solution:
- Python (Requests/Scrapy + Playwright for JS sites)
- modular architecture for streams
- incremental collection + deduplication
- collection of contacts by roles
… - storage in PostgreSQL or SQLite
- export to CSV, Google Sheets
- logs, retry, rate limit
Filters, roles, and sources - through configuration.
Launch via cron.
I have created similar systems for collecting contacts, I understand anti-bot measures, stability, and scaling.
I am ready to quickly create an MVP and then expand it for your cases. Please send the sources for review.
-
1621 35 1 6 I have a ready-made solution. Its main functionality:
✅ Global coverage: The platform automatically scans thousands of open sources, websites, and government registries worldwide for instant information retrieval.
✅ Deep data collection: The system independently extracts important details from social networks, media resources, and professional forums, forming a complete picture of the research.
✅ Routine automation: Intelligent agents collect data in real-time, completely freeing employees from hours of manual internet searching.
✅ Instant structuring: All disparate information from various sources is automatically transformed into organized tables (Smart Sheets) for convenient analysis.
✅ Maximum accuracy: Smart scraping technology allows for obtaining verified data even from the most complex segments of the web, ensuring the reliability of results.
More details at this linkGoogle Drive
… Implementation cost from 2500 usd, timelines from 10 days.
-
148 1 1 Good day. I am ready to complete this project as I have extensive experience in application development.
-
1100 8 1 Hello!
I have experience in developing solutions for parsing and data collection, specifically I implemented a service for collecting contacts from LinkedIn and other platforms with subsequent storage in PostgreSQL.
I will implement a script that will regularly retrieve data from specified APIs and websites, perform parsing, and store the results in a structured format. I will use FastAPI for fast request processing and BeautifulSoup or Scrapy for parsing.
Additionally, I will set up deduplication logic to improve the quality of contact collection. Let's discuss the details!
Current freelance projects in the category Cryptocurrency & Blockchain
I am looking for a strong specialist in trading bots and agents.
250 USD
It is important to start - TODAY I need a person who has experience in writing trading bots on DEXs (needed on the BSC network) with an understanding of transaction costs, gas, etc. I want to test several trading strategies on a real trading agent I need to quickly… Cryptocurrency & Blockchain, Bot Development ∙ 4 days 6 hours back ∙ 36 proposals |