Switch to English?
Yes
Переключитись на українську?
Так
Переключиться на русскую?
Да
Przełączyć się na polską?
Tak
Post your project for free and start receiving proposals from freelancers within minutes after publication!

Mapping PDF Data to Excel Columns with Coordinates


Applications 2

Application viewing is only available registered users.
  1. 445    2  0
    1 day150 USD

    Hello, Adi Yancher
    Hope you are well.

    Nice challenge for a modern programmer..

    The simplest way is to use python libraries for your case.
    Investigated this issue in python. Nice. Has some drawbacks. As well as distribution size.

    Investigated apache pdfbox for java. More consize results.
    There is no need for OCR-ing. But not investigated apache POI yet.

    Anyway, there should be a graphical user interface, parsing/mapping rules, and text to excel mapping as well as. common pdf document templates and so on.
    May be thinking ahead for web-service for another users.

    Solution:
    OS platforms - where java is running on.
    java, apache pdfbox 3 apache POI
    Optionally: tesseract-ocr.
    Optionally: tesseract-ocr. model extra training.

    Will be glad to hear your mind.

    With regards.

  2. 316  
    7 days200 USD

    Hello! 👋

    I am excited to assist with your project of creating a powerful and efficient script/tool for extracting Hebrew-language data from PDF files and populating it into Excel. Here's why I'm the perfect fit for this task:

    Why Choose Me?
    Expertise in OCR and Automation:

    I have extensive experience with Tesseract OCR, including working with Hebrew-language support, ensuring high accuracy in text extraction.
    Proven track record in creating automated tools for complex data extraction and mapping.
    Flawless Data Mapping:

    I specialize in designing scripts that accurately identify keywords in PDFs and map them to the correct Excel columns, following predefined structures.
    I can implement error handling for missing or incorrect data, ensuring clean and reliable output.
    Attention to Detail:

    I understand the importance of handling multiple loan plans and parsing complex fields like dates, interest rates, and monthly payments (including symbols like ₪).
    I'll make sure your Excel output is professionally formatted and meets your requirements.
    Efficient Workflow and Communication:

    I work quickly without compromising quality. The task will be delivered on time with updates at every stage.
    I value clear communication and will ensure the tool/script is easy to use and customizable for future needs.
    My Plan to Execute Your Task
    OCR Setup:

    Configure Tesseract with Hebrew language support to extract text efficiently from PDF files.
    Data Extraction and Mapping:

    Develop a robust script to identify specific fields like Loan Type, Amount, Interest Rate, and map them to their respective Excel columns.
    Error Handling and Formatting:

    Build error-checking mechanisms to handle missing data gracefully.
    Format the output Excel file with precision, ensuring it aligns with your specifications.
    Delivery and Support:

    Provide a fully functional and tested script or tool.
    Offer post-delivery support to ensure seamless integration and use.
    Let’s Get Started!
    I’m confident that I can deliver a high-quality solution tailored to your needs. Let’s discuss your requirements further, and I’ll make sure this project exceeds your expectations. I look forward to collaborating with you! 😊

  3. 5149    210  0
    7 days200 USD

    Hello,
    I can implement a solution for your project as a .exe program for Windows.
    However, I have a few questions to discuss:
    - Do all PDF files follow the same template as the attached file?
    - To better understand the information connections, could you record a video showing how you manually fill in an Excel file based on a PDF file?

  4. 1 proposal concealed

Current freelance projects in the category Databases & SQL

It is necessary to check the scripts and update the data in the Postgres database.

It is necessary to correct the SQL scripts for the Postgres database. It is required to check the scripts and update data from external Excel tables and between two Postgres databases (different servers). Scripts will be run through AnyDesk using Navicat. List of data for…

Databases & SQL ∙ 1 day 5 hours back ∙ 18 proposals

Need an Airtable architect to build a relational schema and a new clean Airtable base.

Need help rethinking and building a clean relational schema for an internal operational system on Airtable. The current database is already in use by the team, but it has grown organically: the structure is partially flat, some tables/views are actively used, while others are…

Databases & SQLDesktop Apps ∙ 1 day 22 hours back ∙ 9 proposals

Basketball Coaching Education Platform + Custom CMS

Basketball Coaching Education Platform + Custom CMSProject Overview We are looking for an experienced web development team or full-stack developer to build a modern basketball coaching education platform. The website will provide basketball coaches with access to educational…

Databases & SQLWeb Programming ∙ 2 days 16 hours back ∙ 88 proposals

Integration of Viber in 8.3

223 USD

Need Viber integration into own CRM (1C 8.3)About the Company The company "Domofon System" is engaged in the installation and maintenance of intercom systems. Base of over 40,000 subscribers. We work on our own customized system based on 1C 8.3. We are looking for a specialist…

Databases & SQLBot Development ∙ 2 days 19 hours back ∙ 16 proposals

Refinement of 1C UT 11 for Zebra TSD (RDP): different sound signals when scanning

22 USD

Configuration: 1C UT 11 Address warehouse Zebra TC26 TSD Work via RDP Product scanning is performed in receiving, placement, picking documents, and other warehouse operations. Current problem: Warehouse workers operate through the Zebra TSD. When scanning, they do not always…

C#Databases & SQL ∙ 4 days 17 hours back ∙ 6 proposals

Client
Adi Yancher
Israel Бээр-Шева  21  1
Project published
1 year back
239 views
Tags
  • OCR
  • tesseract
  • Excel