LinkedIn Data Scraper
Project Objective: The research objective pertains to comprehending Internet Presence, aiming to develop an Identity Scraper that can analyze and compile information available about an individual from publicly accessible sources. This investigative endeavor will be structured into multiple phases, each intended to yield concrete outcomes that can be presented to the board to demonstrate advancement and practicality. Input parameters may include a person's name, photograph, or phone number, while the anticipated output will consist of one or more profiles accompanied by a probability score indicating their authenticity.
Project Phases and Deliverables
Phase 1: Pilot Project - LinkedIn Data Scraping
Objective: Conduct a pilot project focusing on scraping LinkedIn data to identify individuals based on provided names. This phase also includes a scalable database to store the data collected. In Phase 1 the database will store the outputs of the search and also include basic information such as identifier, data source, etc.
Input Requirement: Basic name (e.g., John Doe).
Expected Output: Profiles corresponding to individuals sharing the provided name, including:
- First Name
- Surname
- Location
- Occupation
- Profile Picture
Deliverables for Phase 1:
- Data Collection Report:
- Detailed report on the data scraping process, including methods and tools used.
- Explanation of data sources and the legality of scraping LinkedIn data.
- Profile Compilation:
- A database of structured profiles matching the provided names
- Each profile to include the specified details as per expected output
- Probability Score Algorithm:
- Development of an algorithm to assign a probability score indicating the authenticity of the profiles.
- Documentation explaining the criteria and logic behind the probability scoring system.
- Presentation to the Board:
- A comprehensive presentation summarizing the findings and demonstrating the practicality of the Identity Scraper.
- Visuals and charts showcasing the effectiveness and accuracy of the tool in identifying profiles.
If Phase 1 is a success and approved, additional phases will be commissioned to the developers based on the success shown in Phase 1.
Additional Phases:
Phase 2: APIs, Search Engine scraping, expansion to other social media platforms (e.g., Facebook, Twitter, Instagram), and public databases with incorporation of additional input parameters (e.g., current employer, phone number, etc).
Phase 3: Implementation of a GUI and incorporation of Search Tables, Search Optimization, Advanced Analytics and Reporting features
Phase 4: Enhancements to the probability scoring algorithm based on feedback and results.
Requirements for Freelancers:
Proven experience in web scraping and data analysis.
Proficiency in programming languages such as Python, Java, or relevant alternatives.
Proficiency in RPA and automation tools such as UiPath, BluePrism, Pega, AA or relevant alternatives.
Familiarity with Social Platforms and APIs.
Ability to deliver detailed reports and presentations.
How to Apply:
– Provide a brief introduction about yourself and your experience in similar projects.
– Include examples of past projects relevant to web scraping and data analysis.
– Outline your proposed approach for Phase 1 of the project.
– Reply with your availability and expected timeline & budget.
-
30 days1000 USD
956 14 0 30 days1000 USDExperience in scraping and creating analytical systems is significant. Projects are present in the portfolio and order history. Data scraping is not difficult. Probability assessment of authenticity should be conducted using artificial intelligence. The recommended model by me as of today is ChatGPT4o. The approximate cost of one AI-assisted assessment will be around 1-3 cents. Graphic representation, presentations for the board of directors should be done by another specialist. Data scraping, database work, analytics are in a completely different professional field than creating presentations and representations. It is also not desirable to implement the entire project by the efforts of one person regardless of their level of professional training, due to the presence of tasks in the project related to completely different fields of knowledge. The first phase of the project should also be divided into several parts. The price and deadline are conditional for now, as the project needs further development.
Current freelance projects in the category Data Parsing
Need a parser for the online store https://www.lcsc.com/It is necessary to regularly (once a month, or upon script launch) obtain up-to-date information about the products available in the store. https://www.lcsc.com/ from the catalog of all sections.… Data Parsing ∙ 7 hours 29 minutes back ∙ 28 proposals |
OpenCart — rental catalog of special equipment
135 USD
OpenCart — Equipment Rental Catalog Need to launch an equipment rental catalog on OpenCart. Theme: excavators cherry pickers forklifts generators cranes scaffolding other construction equipment. It is preferable that you already have a ready-made template or developments… Web Programming, Data Parsing ∙ 23 hours 55 minutes back ∙ 46 proposals |
Transfer the program - the server where the program was located has crashed (officially permitted parsing of government data)
47 USD
Hello! My client has encountered the case described below. We need help transferring to a new server and testing the program. It would be better to have a programmer who understands parsing. Software & Server Configuration, Data Parsing ∙ 1 day 3 hours back ∙ 27 proposals |
Website parsingImplementation of 4 parsers (directory websites) is required. There is a technical specification, and there is a code example as a reference. The tasks include: Writing a parser Integrating a proxy Deduplication logic (transfer the logic from the example) Hashing logic based… Data Parsing ∙ 2 days 20 hours back ∙ 42 proposals |
Collection (parsing) of product database from supplier websites (Excel / CSV)
226 USD
Collection of product database from supplier websites (Excel / CSV) Good day. A specialist is required to collect and structure data from several supplier websites, access to which will be provided.Task: A unified product database needs to be created in Excel (XLSX) or CSV… Web Programming, Data Parsing ∙ 4 days 3 hours back ∙ 103 proposals |