Write a review parser for g2.com
A parser for reviews needs to be developed in Python to collect data from product pages. Example target page: https://www.g2.com/products/pandadoc/reviews
Main requirements:
- Programming language: Python 3.x
- Output reviews in JSON format
- Type of application: console
Example json:
{
"product_name": "PandaDoc",
"number_of_reviews": "2,548 reviews",
"reviews": [
{
"review": "What do you like best about PandaDoc?PandaDoc is very affordable for small businesses. It has enough features to help streamline the \"get a contract signed\" process. I love the feature of being able to assign information and signature fields to different users...",
"rating": "4.5"
}
]
}Development includes subtasks such as:
- Handling pagination to collect all reviews
- Implementing a proxy/IP rotation mechanism to bypass possible restrictions
Additional information:
- Reviews are located in elements with the class .paper.paper--white.paper--box, the review text is in a div with the attribute itemprop="reviewBody"
- It is recommended to use standard Python libraries, such as BeautifulSoup or Scrapy, however, the final choice is yours.
Expected result:
A console application in Python that allows entering links to various products and collecting all available reviews, saving them in a .json file. + Instructions on which service to use for ip/proxies.
-
204 Good day, I have extensive experience in writing parsers using the BeautifulSoup4 library, I would be happy to complete your project!
-
1831 46 3 Hello!
I am interested in your project, I have extensive experience in asynchronous/multithreaded parsing (Requests, Websockets, AioHTTP, HTTPX, BS4) and emulating user actions (Selenium/Playwright).
Contact me to discuss the details and deadlines for this project!
-
194 Hello! I am an experienced Python developer and I will create a console application for parsing reviews with pagination handling and proxy rotation, outputting data in JSON format. I guarantee quality code and instructions for setting up the proxy. I will complete it in 5 days for 2500 UAH. I would be happy to discuss the details!
-
15546 214 1 Good afternoon!
I have extensive experience in data parsing.
I will develop a console program for you.
I would be happy to discuss collaboration.
-
1251 35 1 3 Hello Yaroslav, I am ready to complete the project.
I will write a convenient console application in .py
In general, the task is clear, it is necessary to study what limitations may be from Cloudflare, etc.
-
1982 47 7 2 Hello!
My name is Pavel, I am a Python developer with experience in creating parsers. I am ready to develop a console application for parsing reviews from product pages (for example, like on G2). I use Python and libraries such as BeautifulSoup or Scrapy, and I also implement proxy/IP rotation to bypass restrictions.
I would be happy to discuss the project details and propose a solution.
Best regards,
Pavel
-
95478 1271 1 10 Hello. I have extensive experience with Python/Node.js. I am ready to implement a parser.
-
397 14 0 Hello, my name is Bogdan and I would like to work on your project.
I have already written a parser for this site and I am ready to refine the logic with proxies and saving in json.
-
434 9 0 Good evening. I can do it in Python using requests/bs4/scrapy or selenium/playwright.
-
1285 30 0 I have completed similar tasks multiple times.
Feel free to contact me, I will write a parser according to your requirements. Write to me privately, we will discuss the technical specifications and agree on the details of the work.
-
1330 21 0 Hello, I will gladly help with your project.
I looked at the example page and your json file example. As I understand, you want to input only the address of the first review page with the product name and for the parser in json to collect all reviews from all subsequent pages until the end.
To discuss the details, deadlines, and price, let's move to private messages. For now, I will set the starting price for the parser.
-
3392 84 0 Ready to take it on.
Write to me, we will discuss the order details.
I will implement it in Python.
-
Вам нужно только тело комментария или еще и тайл и мета информация? Остальные пункты понятны, на сайте есть капча, но с прокси она не часто появляется
-
Current freelance projects in the category Data Parsing
Consultation on parsing Instagram account subscribersHello. It is necessary to conduct a preliminary assessment of the feasibility of the following task. I have a list of Instagram accounts. The goal is to obtain contact information (primarily email addresses) of users who follow these accounts. Previously, I encountered companies… Data Parsing ∙ 17 hours 4 minutes back ∙ 10 proposals |
A specialist is needed to find contacts of decision-makers in Ukraine.It is necessary to gather a database (or ready database) of contacts of decision-makers (DMs) in companies in Ukraine. Information Gathering, Data Parsing ∙ 21 hours 36 minutes back ∙ 13 proposals |
Need to scrape data from LinkedInWe need to scrape data from LinkedIn based on our list. For each entry, we need to find and collect available data if it exists on the LinkedIn profile, including the profile picture on the LinkedIn social network, email address, links to social media, company website, and… Data Parsing ∙ 1 day 3 hours back ∙ 23 proposals |
Parsing and classification of dataWe are looking for a developer to implement a system for collecting and structuring data from open sources. We have a database of small business owners in the USA, which contains the person's name, company name, address, and state. It is necessary to build a process for… Web Programming, Data Parsing ∙ 1 day 4 hours back ∙ 36 proposals |
Svitlahata
17 USD
It is necessary to import 1819 products from the XML/YML feed of Prom.ua to OpenCart 3. A ready XML file is available, which contains product names, descriptions, prices, photos, specifications, manufacturers, and categories. Requirements: import all products to OpenCart… Content Management Systems, Data Parsing ∙ 2 days 7 hours back ∙ 33 proposals |