Switch to English?
Yes
Переключитись на українську?
Так
Переключиться на русскую?
Да
Przełączyć się na polską?
Tak
Post your project for free and start receiving proposals from freelancers within minutes after publication!

It is necessary to parse the PDF file.

Translated

Applications 2

Application viewing is only available registered users.
  1. 10755
     149  0

    7 days111 USD

    Good day. It is a challenging task, but solely due to the need for category mapping without specific instructions about it. If there is a detailed instruction or an alternative method, the price will be lower.

  2. 2223
     98  0

    1 day45 USD

    Good day
    I parsed a piece of https://docs.google.com/spreadsheets/d/1VrUl-Fax9QFm9W7DfT-VU6OYzqEn1UxUAWqwAzPA1dA/edit?gid=0#gid=0
    Is that right?
    Please reply

  3. 3529
     31  0

    1 day45 USD

    Hello! I have been developing parsers for over 3 years. I will be able to parse your files according to the instructions.

    I am ready to start working today and provide the finished result tomorrow.

  4. 315  
    10 days56 USD

    Ready to start working at any time, we can discuss the deadlines and cost, write back and I will send the contact details for communication.

  5. 138  
    2 days16 USD

    Hello! I am ready to implement parsing of your PDF file in C#. I have experience working with regular expressions (Regex) for precise data extraction. I will provide a structured result (JSON/Excel/Text). I work through Safe.

  6. 294    1  0
    2 days56 USD

    Evgeny, good afternoon!
    I have studied your PDF price list, the example table, and the structure of the website balluff-ua.com.

    What exactly I will do:
    1. Synchronization with the website: I will analyze the category tree on balluff-ua.com/product and set up the script so that the data from the PDF is automatically matched with the necessary sections of the website (Sensors -> Inductive Sensors, etc.). In columns A and B, you will have the perfect order for import.
    2. Solving the problem of "crooked" columns: I will use the coordinate method (pdfplumber library) to extract data. This will allow for the correct separation of "Part number," "Ordering code," and "Price," even if the text in the PDF is misaligned or the columns have different widths.
    3. SQL compatibility: You will receive a file (Excel/CSV) in which the data is already structured for your SQL database, which will eliminate errors when uploading to the website.

    My advantage: I do not just copy text, but create an algorithm that "understands" the hierarchy of products.

    I would be happy to discuss the details!

  7. 172    1  1
    1 day111 USD

    Good day. I am ready to complete this project and have extensive experience in developing various applications.

  8. 372    4  0
    3 days89 USD

    Hello. I have reviewed the task and am ready to assist with parsing the Balluff catalog. I have experience working with complex PDF layouts where columns are arranged asymmetrically, so I will use Python scripts to accurately configure the processing of each page and maintain the hierarchy of categories. I will take into account the structure of the sensor section from the website so that inductive, capacitive, and other types of devices are placed in the appropriate Excel columns according to your example. Since you mentioned the presence of an SQL database with the correct layout, I will also be able to help with synchronizing this data for the website in the future, as I understand the logic of relationships in databases.

  9. 758    6  0
    4 days89 USD

    Good day! I have extensive experience in parsing documents. Could you please clarify, is it necessary to parse one PDF file? What is the volume?

    On average, I can complete it in 2-3 days.

  10. 1344    6  0
    7 days557 USD

    Hello!
    I understand that it is necessary to parse a PDF file into an Excel table according to the example, and then upload the data to the website.
    I will do this using n8n, Google Sheets, and Apps Script.
    Please provide the details — I will clarify the terms and cost.

  11. 1315    7  0
    3 days134 USD

    Good day.
    I am ready to take on the work.
    I will be able to create automation that will parse and categorize your files.
    Write to me privately, we will discuss possible nuances and can get started.

  12. 1239    16  0
    1 day78 USD

    Hello!
    I understand your task, I suspect that you either have more than one PDF or will have more than one. Therefore, I suggest writing a parser right away that will take a PDF file as input and output a valid Excel file. You will only need to run the script and specify the path to your file.
    Language: Python.
    If necessary, the script will be compiled into an application for Windows (exe file).

  13. 702    1  0
    3 days89 USD

    Hello! I have extensive experience in parsing. Therefore, I offer a loyal price and quality work. Write to me)

  14. 826    3  0
    3 days45 USD

    Hello! I am a Python developer specializing in data parsing and automation (PDF, Web, SQL).

    What I will do:
    1. I will deal with crooked columns: I will write a script that will correctly process the two-column layout of the PDF so that the data does not get mixed up.
    2. I will categorize: I will set up automatic linking of products to the necessary sections (Inductive, Capacitive, etc.), based on the structure of the Balluff website.
    3. I will prepare Excel: You will receive a table exactly according to your example, ready for import.

    I am also ready to work on the second task with the SQL database. Message me privately, and we will discuss the deadlines and budget!

  15. 3920    101  0
    1 day18 USD

    Hello. I am ready to help with this task. The PDF contains all the necessary information for the work. 1 day, 800 UAH.

  16. 5860    345  0
    1 day22 USD

    I can transfer data from PDF to Excel without "looking at the site."

  17. 3367    148  4   1
    1 day45 USD

    Good day.
    I will create a solution in node js that will correctly process the file.
    The price and deadline are negotiable.
    If you have any questions, you can write to me in private messages.

  18. 4182    198  2   5
    2 days111 USD

    I have experience parsing PDF into tabular formats while preserving the structure of categories, including with asymmetric columns. For this, I have used various PHP libraries and Python tools integrated into workflows.

    In your case, I will take the categories "Sensors" and their subcategories as a basis, according to the assignment and the website balluff-ua.com. The final Excel table will be structured in a way that makes it easy to upload data to the website and correctly display categories.

    I use PHP with the Laravel framework, which will speed up work with the database (MySQL/PostgreSQL) and facilitate further integration. I have experience with REST API and extracting data from complex structures — this will help in the future automatic data updates.

    I suggest first creating a demo version of the parser for the selected categories to ensure the correctness of the data and then proceed to further development and integration with the database in the next stage. I can organize the process through Docker for ease of launching and debugging.

    I am ready to discuss the details and start according to your schedule.

  19. 693    15  0   2
    2 days36 USD

    Good day! I will convert your PDF file to Excel quickly and efficiently! I have already done similar projects! I will be happy to help you!)

  20. 240  
    4 days67 USD

    Hello, Evgeny!

    Your case: parsing the PDF catalog of Balluff into Excel according to the template from Google Sheets (you have it open) + matching categories/subcategories on balluff-ua.com/product. Start with the category Sensors (SENSORS) and its subcategories.

    Plan:

    1. I will take the PDF + template table + category structure from the website balluff-ua.com. If there is a link to the PDF, I will take it immediately; if the file is attached in FH, please send it in the comments to the project.
    2. I will write a parser in Python (pdfplumber / tabula for tables + custom logic for asymmetric columns, as you described). For positions where the structure breaks, I will provide a fallback with manual processing.
    3. Normalization: I will match the articles with the subcategories of balluff-ua — I will do this by exact code matching + fallback by category name. Unmatched positions will be marked with a separate status "needs_review."
    4. Export to Excel strictly according to your Google Sheets template: the same columns, the same data types, the same format.
    5. I will provide you with XLSX + a short ReadMe: how many positions were parsed cleanly, how many ended up in needs_review and why. So you can immediately see the quality.

    Cost: 3,000 UAH for parsing Sensors and its subcategories into the table according to the template. Deadline is 3–4 working days from receiving the file.

    I am ready to evaluate the second task (matching with your SQL database of correct composition) separately when you send the schema dump of the database.

    Please send in the comments to the project: the PDF itself (or a link), confirmation that the Google Sheets template is current, and if available — an example of a "correctly matched" position for calibration.

  21. Another 7 proposals concealed
  • Vladimir B
    17 April, 19:04 |

    смотрите "можно смотреть на сайте" это значит что нужно реализовать парсинг сайта, чтобы просто посмотреть. Поэтому хотелось бы чтобы вы уточнили тз, а именно 

    1) распарсить пдф, и вы точно указываете пример данных из пдф и что и куда заливать в гугл таблицу

    2) имея данные из пдф в нужной форме уже ставить задачу поиска этих данных на сайте и заполнить категории, либо что-то еще.


    Желательно разбить это двумя подзадачами, чтобы за каждую определять свой бюджет.  Ну если конечно Вы хотите по нормальному.

Current freelance projects in the category Data Parsing

Collection of a database of designers, architects, and installation companies in Ukraine

Task description: It is necessary to collect an up-to-date database of contacts in Ukraine for further B2B communication. Required categories: Interior designers Architects / architectural firms Installation companies Companies engaged in repairs, finishing, lighting,…

Data Parsing ∙ 9 hours 13 minutes back ∙ 26 proposals

Telegram group parser

22 USD

# Technical Assignment ## Project Goal It is necessary to develop a parser for Telegram groups that will find groups based on specified keywords and save the results in text files. ## Main Functionality ### 1. Group Search The parser should search for Telegram groups…

Data ParsingBot Development ∙ 11 hours 47 minutes back ∙ 43 proposals

Parsing products, preparation for import to WP

Scrape the full catalog of these websites: https://svit-mebliv.ua/ https://kompanit.com.ua/ru https://amia.com.ua/ https://mebliromax.com.ua/ https://pehotin.com.ua/catalog/ https://www.sokme.ua/ru/ All products need to be combined into one general table for import into WP.…

Web ProgrammingData Parsing ∙ 1 day 3 hours back ∙ 53 proposals

I am looking for a programmer for OpenCart.

Good day 1) It is necessary to implement on the website dneprkomfort.dp.ua A module for Ukrainian banks has been purchased, and we have already integrated Mono Bank Here is an example from our competitor It is necessary to implement installment payments, purchase in parts…

Web ProgrammingData Parsing ∙ 2 days 1 hour back ∙ 48 proposals

A specialist in Telegram promotion is required.

28 USD

Tasks: invite real users from the username database to new chats and send messages to the target database. Only quality traffic and work with a live audience are of interest — performers using bots, fake engagement, or low-quality methods are requested NOT TO DISTURB. Work…

Data ParsingSocial Media Marketing (SMM) ∙ 6 days 3 hours back ∙ 9 proposals

Client
Yevgeny Ded
Ukraine Odessa  85  0
Project published
2 months 9 days back
443 views
Tags
  • Excel
  • PDF
  • SQL