Data parsing
-
14 days550 USD
1069 10 0 14 days550 USDGood day, I have extensive experience in web scraping, completed projects of various complexity of data collection, ETL automation, proficient in handling all functionalities (working with requests, cookies, proxies, user-agents, authorization, bypassing CAPTCHA including Cloudflare), with subsequent data processing and transformation.
-
5 days250 USD
960 8 2 5 days250 USDHello, I have reviewed your project brief and I am ready to start. In your code, I would replace `request` with `aiohttp`, `beautifulsoup4` can be replaced with `lxml`, `html5lib`, or `scrapy`. I use the version control system GitHub for each project, with over 4 years of experience working with Python. I have recently completed a similar project involving pauses and parsing of large volumes of data. I implement parsing within legal and ethical boundaries.
About me: I design a solid architecture for easy deployment using Docker and can package everything needed there. I have a high level of knowledge in `bin`, `bash`, `zsh`, and am a proficient Linux user.
I confidently implement both web interfaces and GUI interfaces. You can view my portfolio at https://github.com/sashabodiul and I can personally demonstrate real completed projects. (PyQt5, kivy, tkinter, Flask, FastAPI, Django)
Regarding technology stack, I have a strong background.
In parsing, I use proxies, user-agents, and specify various headers when necessary. I can work with inspectors, JavaScript, DOM, and extract data even from AJAX and GraphQL.
Main parsing libraries: `aiohttp`, `lxml`, `bs4`, `scrapy`
Web automation: Selenium
Creating APIs in Golang and Python, GinGo, Flask, FastAPI, RestfulAPI, Swagger
I have experience working with Google Sheets and Google Cloud Console.
… Ability to work with various SQL and NoSQL databases.
Solving captchas with anticaptcha.
Knowledge of Semaphores, threading, concurrent.future. Ability to work with numpy and more_itertools, specifically `chunked` for dividing parallel scraping into uniform threads, which greatly speeds up data collection.
Basic knowledge for building web interfaces.
Additionally, I can work with APIs, extract hidden APIs, and work with XML.
PyQuery for working with HTML documents.
Pyppeteer for automations.
-
7 days500 USD
257 7 days500 USDGood time,
Interesting project.
Possibly missing a database or a description of how data handling works (where and how to store them while in use and whether it is necessary).
In principle, the logic is clear, there is a rough solution from commercial experience.
And it will be necessary to clarify the legislation regarding the resource, 1 - perhaps they have an option to receive the same information in xls, csv format or just a download link.
2 - how legal is it to check a site without permission))
… My advantages - I have diverse commercial experience in development for over 6 years) and a broader project view due to my specialized higher education.
My stack
Python/Django/FastAPI
PostgreSQL + SqlAlchemy + Alembic
All the best,
Sincerely,
Tali.
-
4 days120 USD
209 1 0 4 days120 USDI am interested in the offer, confident that I meet your requirements, looking forward to your feedback, thank you
-
14 days800 USD
187 1 0 14 days800 USDGood day,
if interested I can implement it on node.js
I have a lot of experience in parsing of different complexity, write to discuss the details.
-
3 days250 USD
1993 12 0 3 days250 USDGood day
I have extensive experience in parsing of any complexity
Write to me
Current freelance projects in the category Data Parsing
Database of websites on WooCommerceIt is necessary to compile a database of Ukrainian online store websites on WooCommerce with the contact information provided on the sites. Only active websites (indicator: updated catalog/content, working domain) Table format - website address, phone number, e-mail. Data Parsing ∙ 1 day 19 hours back ∙ 20 proposals |
Create a dashboard in https://airtable.com/ for the performance of advertising creatives from Facebook ads.Full specification https://docs.google.com/document/d/1_n_oYRNZWYxalUA---DM5AD1b5ZSrtePw5J4G42svGw/edit?usp=sharing Databases & SQL, Data Parsing ∙ 3 days 9 hours back ∙ 17 proposals |
Creation of an Excel file for uploading products to the websites of other partners.I am interested in creating an Excel table with all parameters. Here is the website - https://heiztechnik.com.ua/ And the positions I am interested in to be transferred: Manual boilers: 1) TIS UNI 15-95 kW (10) pcs 2)TIS HARD 150-500 kW (7) pcs Pellet boilers: 1)TIS PELLET… Data Parsing ∙ 3 days 14 hours back ∙ 35 proposals |
A developer is required for parsing the catalog and automating data import.Detailed technical specifications in the attached document Please indicate the estimated cost and timeline in your response Do you have experience working with parsing large catalogs What possible difficulties or limitations do you see in this task Databases & SQL, Data Parsing ∙ 3 days 16 hours back ∙ 40 proposals |
Find a product feed (Google Merchant XML) for a website on OpenCart
16 USD
It is necessary to find a direct link to the active product feed (XML) of a competitor for Google Merchant Center Platform (CMS): OpenCart / ocStore Find the original feedRequirements for the result: Working link to the XML file Python, Data Parsing ∙ 3 days 22 hours back ∙ 25 proposals |