Writing a parser on Python to collect emails
You need to write a parser on Python, which will go through the list of published sites and collect all emails.
All websites are European companies (the structure of websites is different)
1 .Parser should search for the data in the footer/header, also, enter the “Contact”/”Our” section and search there, as often in the footer/header may not be data or there only the company’s mail, not the CEO and other employees.You only need to collect emails, other contact details to be missed, but all emails from all pages must be collected.
ThreeThe location of the contact details may be both on the main page and on the individual intended page.4 .The location of the contact data can be both in the site’s hat, in the footer, and anywhere else on the page.The parser must work in a multi-way mode
Collected emails:
1. to be structured (e-mail should be opposite to the corresponding URL of the site and should not be dispersed in different columns in the table, see.Screenshot “needed look”)
Not having any excessive information (phones, names and surname, post, part of the page code)
Remove the duplicates
The outcome of results
The results should be in the form of a CSV file.
by ITOG
The final product is a working parser with the source code and with the documentation, in which you can independently replace the links and that it performs the above tasks.Additionally
The task is attached to screenshots of how the collected emails should look and be structured.And also an example of how e-mails should not look.
Applications 2
-
Good day ! Very interested in your request, ready to get to work.
-
3840 78 0 I have done this project before (you can see in my early reviews). I can do it in PHP, but I can do it in Python. Can you download the list of sites?
-
322 3 0 Hi, I’m interested in your project, I have a great experience in data collection, I’ll write a script on Python, I’ll do it quickly and quality. Ready to start right now
-
434 9 0 Good day . Please send a few links to the list in the private message.
-
194 Good day !
Full-stack developer with more than 6 years of work experience, your project is very interesting, I have the necessary experience to implement it, I offer such technologies Node.js + Vue.js. Let us make a call for a detailed discussion of the task, share our vision and discuss cooperation, write
-
2762 58 0 Good day .
I am working on the development of parsers.
I am waiting for you in personal messages to discuss the project.
Current freelance projects in the category Data Parsing
BOT, TG
226 USD
Hello everyone, friends. I will try to describe what is needed in more detail, but if questions arise, we will discuss them more deeply and clarify in private messages. We are focused on sales in the construction sector: in general, this includes several sales channels and… Data Parsing, Bot Development ∙ 2 hours 49 minutes back ∙ 25 proposals |
Need a parser for the online store https://www.lcsc.com/It is necessary to regularly (once a month, or upon script launch) obtain up-to-date information about the products available in the store. https://www.lcsc.com/ from the catalog of all sections.… Data Parsing ∙ 1 day 3 hours back ∙ 41 proposals |
OpenCart — rental catalog of special equipment
135 USD
OpenCart — Equipment Rental Catalog Need to launch an equipment rental catalog on OpenCart. Theme: excavators cherry pickers forklifts generators cranes scaffolding other construction equipment. It is preferable that you already have a ready-made template or developments… Web Programming, Data Parsing ∙ 1 day 19 hours back ∙ 55 proposals |
Transfer the program - the server where the program was located has crashed (officially permitted parsing of government data)
46 USD
Hello! My client has encountered the case described below. We need help transferring to a new server and testing the program. It would be better to have a programmer who understands parsing. Software & Server Configuration, Data Parsing ∙ 1 day 23 hours back ∙ 29 proposals |
Website parsingImplementation of 4 parsers (directory websites) is required. There is a technical specification, and there is a code example as a reference. The tasks include: Writing a parser Integrating a proxy Deduplication logic (transfer the logic from the example) Hashing logic based… Data Parsing ∙ 3 days 15 hours back ∙ 44 proposals |