Writing a parser with data collection
2ndGo to the “Data Personal” tab:
Here we must introduce:
Name (name) = “Nombre(s)”
First Name = “Primer Apellido”
The second name = “Segundo apellido”
Let’s look at the example:
Here we have:
Nombre(s): AARON DANIEL ; Primer apellido: RAMIREZ ; Segundo apellido: SANCHEZ;
That is, the translation by entering the site will need to divide Name into 3 components.ThreeFor the rest of the parameters, we will have to check out all the combinations or find a method to assess the most likely correct input data.“Sexo*:” (sex) is Python libraries that define it exclusively by name with a high degree of accuracy.- Día de nacimiento*; Mes de nacimiento*; (day, month of birth) - these parameters need to be cleared by checking
- Año de nacimiento* (year of birth) - can be calculated using year_of_graduation from the input file.People usually finish their studies at a bachelor’s degree (4 years of study) and arrive when they are around 21-23 years of age., year_of_graduation shows when they finish the study.- Estado* (the state in which they were born) - one of the possible ways of defining - to take a full list of states built by the number of inhabitants in Mexico and to start with those to which the majority of the population belongs.At the exit:
Let’s take an example of the Random Policy:
LUIS VIDEGARAY CASO 10.08.1968 Man in Mexico City
All data collected to be recorded in separate columns of CSV:
If there are several coincidences, write in separate lines.To solve CAPTCH I propose to use one of the services:
HTTPS://anti-captcha.com/
HTTPS://rucaptcha.com
Work under Linux.Documentation and comments in the code in English.At the end of the project, the source code of the written program is required.
Applications 2
-
Good day ! I will be ready to get to work after discussing the details. I have experience in doing such tasks.
Current freelance projects in the category Data Parsing
Need to scrape data from LinkedInWe need to scrape data from LinkedIn based on our list. For each entry, we need to find and collect available data if it exists on the LinkedIn profile, including the profile picture on the LinkedIn social network, email address, links to social media, company website, and… Data Parsing ∙ 4 hours 22 minutes back ∙ 15 proposals |
Parsing and classification of dataWe are looking for a developer to implement a system for collecting and structuring data from open sources. We have a database of small business owners in the USA, which contains the person's name, company name, address, and state. It is necessary to build a process for… Web Programming, Data Parsing ∙ 5 hours 30 minutes back ∙ 29 proposals |
Svitlahata
17 USD
It is necessary to import 1819 products from the XML/YML feed of Prom.ua to OpenCart 3. A ready XML file is available, which contains product names, descriptions, prices, photos, specifications, manufacturers, and categories. Requirements: import all products to OpenCart… Content Management Systems, Data Parsing ∙ 1 day 8 hours back ∙ 32 proposals |
Data parsing through mobile APILooking for a person with experience in data parsing through mobile API, for parsing e-commerce, rozetka and similar sites. Tasks: 1. Intercepting traffic from the mobile application (Android), setting up a proxy, analyzing requests. 2. Identifying the necessary API endpoints… Python, Data Parsing ∙ 3 days 3 hours back ∙ 31 proposals |
Looking for a programmer or vibe coding Automation scraping data 2https://drive.google.com/file/d/14tP5XWJB9acV4gn_cJrFwMpihUj3EbQz/view?usp=sharing I accidentally sent that link Web Programming, Data Parsing ∙ 3 days 5 hours back ∙ 40 proposals |