Parsing text PDF with tables
It is necessary to parse a text PDF with tables and create a dynamic object with all the data that is in the document.
It contains 2 main tables that can be "glued" into one and then for each record from this table - there is a detailed information table a little lower after the main tables.
Ideally, I would like to be able to work with this data later through Python.
Thank you in advance.
Applications 1
-
1 day108 USD1 day108 USD
Good day. I have done something similar before, but I would like to discuss the final storage format in more detail. I would be happy to collaborate.
-
5 days108 USD
332 5 days108 USDHello!
I can implement your project in Python. The work plan is as follows:
Parsing PDF with tables using libraries like pdfplumber, camelot, or tabula-py.
Merging the main tables into a single dynamic structure (for example, a list of dictionaries or DataFrame), so that each record is unique and ready for processing.
For each record of the main table, detailed information from the lower table will be linked.
…
Creating a dynamic object/structure that can be conveniently worked with in Python (for example, through pandas or directly as objects/dictionary).
Optionally: the ability to save data in CSV/JSON for further analytics or processing.
The implementation will be flexible, so you can easily filter, analyze, and modify the data after parsing.
I am ready to discuss the details of the PDF and the implementation timeline.
-
3 days135 USD
1002 5 1 3 days135 USDGood afternoon, I can implement this and add AI for normalization, message me privately.
-
1 day102 USD
267 1 day102 USDHello,
I’ve completed your task. From the text PDF with tables, I built a dynamic Python object and a single merged summary table: the two main tables are glued by ecu_code, and for each record the corresponding … DETAILS section is attached.
Deliverables:
A clean CLI script (menu/args) with structured logs and a README.
Results in two formats:
… ecu_summary_merged.csv — merged summary (main tables + “(CONT…)” fields).
ecu_merged.json — dynamic object: summary_merged[] plus details{} per ecu_code.
Verified on your document: parsing is consistent, multiline fields (e.g., CVNS) are handled.
Run locally (if needed):
python parse_ecu_pdf.py --input "Details Report.pdf" --outdir "out" --log "out/run.log"
# or interactive:
python parse_ecu_pdf.py --menu
Attached:
parse_ecu_pdf.py
ecu_summary_merged.csv
ecu_merged.json
(optional) short screencast/screenshots
I can also provide XLSX output and/or a minimal web UI for browsing and search.
Files are ready to hand over.
-
4 days108 USD
297 2 0 4 days108 USDHello. I am ready to implement a parser for your PDF document.
I will do:
– Reading and processing the main PDF with tables
– Merging two main tables into one
– Linking detailed information to each record
– Output as a Python object or pandas DataFrame, which can be easily worked with
The work will be clean, and the code will be understandable. Message me privately — I will show everything and we will clarify the details.
-
1 day108 USD
2225 32 0 1 day108 USDGood day. I have already made this parser. Everything is ready.
Good day. I have already made this parser. Everything is ready.
+++++++++++++++++++++++++++++++++++++++++++++
-
3 days108 USD
2248 63 2 2 3 days108 USDHello!
I have experience working with different types of data
I can help with both text data and images.
-
1 day108 USD
3313 70 1 1 day108 USDHello.
I can create an object with information in pdf, I suggest making a json file. I can also create any other format if necessary.
Write to me to discuss which format will be better to work with the data going forward.
-
3 days108 USD
171 3 days108 USDGood day!
I can implement a solution in Python for parsing PDF:
- extract the main tables,
- combine them into one,
- add detailed information from the following tables,
- save the result in a convenient format (for example, DataFrame or JSON) for further work.
I would be happy to clarify the details of the task and agree on the format of the final result.
-
1 day108 USD
2426 20 0 1 day108 USDGood day, I am ready to complete the task quickly and efficiently, please write to me in private messages to discuss the details. I will be happy to help)
-
1 day108 USD
2223 18 3 1 day108 USDGood day, I have already developed similar parsers, I will do it using Python + pdfplumber + pandas DataFrame. If you are interested - write to me, I will be happy to discuss in more detail.
-
2 days108 USD
1328 35 1 2 days108 USDGood day. I have already done a similar project. But in PHP. If it is not essential that it be PHP, feel free to contact me, I will do it.
-
4 days116 USD
2788 42 1 4 days116 USDMy greetings, Artem
Do you need a utility that you will call from your Python code like
parser -pdf path/to/my.pdf
and receive data in a structured format (some specialized class)?
Maybe I will be of help..
-
1 day108 USD
1495 13 0 1 day108 USDHello! I can implement it. Please message me privately to discuss all the details. I would be happy to collaborate!
-
3 days108 USD
9972 117 0 3 days108 USDHello.
I am a NodeJS developer. I am ready to take it on. Write to me, we will discuss.
-
1 day108 USD
2991 73 4 2 1 day108 USDGood day! I am implementing such a parser in Python!!!!!!!!!!!
Feel free to contact me!!!!!!
Current freelance projects in the category Data Parsing
A specialist in Telegram promotion is required.
29 USD
Tasks: invite real users from the username database to new chats and send messages to the target database. Only quality traffic and work with a live audience are of interest — performers using bots, fake engagement, or low-quality methods are requested NOT TO DISTURB. Work… Data Parsing, Social Media Marketing (SMM) ∙ 1 day 1 hour back ∙ 6 proposals |
Collection of B2B database of companies in Germany
40 USD
Goal: To obtain a list of potential employers (clients) for B2B mailing. Region: Munich (München) + radius of 50 km. Required niches: Construction companies (Bauunternehmen) Food enterprises (Lebensmittelhersteller, meat processing plants, bakeries) Logistics and… Data Parsing, Lead Generation & Sales ∙ 1 day 3 hours back ∙ 26 proposals |
Carrier databaseInterested in compiling a database of carriers in Ukraine for the year 2026, including tankers, tarpaulins, grain carriers, and others. It is preferable to develop a table. Information Gathering, Data Parsing ∙ 1 day 4 hours back ∙ 29 proposals |
Consultation on parsing Instagram account subscribersHello. It is necessary to conduct a preliminary assessment of the feasibility of the following task. I have a list of Instagram accounts. The goal is to obtain contact information (primarily email addresses) of users who follow these accounts. Previously, I encountered companies… Data Parsing ∙ 4 days 20 hours back ∙ 12 proposals |
A specialist is needed to find contacts of decision-makers in Ukraine.It is necessary to gather a database (or ready database) of contacts of decision-makers (DMs) in companies in Ukraine. Information Gathering, Data Parsing ∙ 5 days back ∙ 18 proposals |