Unpacking and assembling documents.
Hello!
It is necessary to write code (Python) that will take a document (pdf or docs), extract text from it, send it to another service for translation, and then reassemble the same document but with the translated text.
An example of the original document is attached.
It is very important that the structure of the document is not violated. This is the main reason I am looking for a specialist because I can extract text myself, even from photos)
In simple words, I need a tool that will parse and reassemble the file but with different text.
The price is negotiable.
ps: Colleagues, please only write if you have done this or are confident that you can do it, thank you in advance.
Applications 1
-
595 2 0 Hello Sergey,
In order not to break the structure of the document, we need to remember how many characters were on each line while reading the document, as well as record the parameters for indentation and paragraphs. As a result, in addition to the text, we will have the indentation parameters and the number of characters in each line.
Then we will get the text translated into another language from the translation service,
And here we will need to connect the OpenAI API, and for it, we will create a prompt of the following form "This text {{{translated text}}} format it according to the following scheme {{{numerical description of line formatting}}} where the key is the line number, and the 3 numbers in the value are the left indentation, the number of characters in the line, and the right indentation."
Thus, thanks to ChatGPT, we will be able to closely match the translated version to the original.
I have extensive experience in solving algorithmic problems and unconventional thinking.
I will gladly complete this task and answer all questions.
…
Sincerely,
Georgiy
-
643 5 0 Hello, Serhiy!
My name is Roman, and I have experience in developing similar tools in Python. I am ready to complete your task with maximum attention to detail, ensuring that the text in the documents is translated without disrupting their structure, including formatting, images, tables, and other elements.
I propose the following approach:
1. I will extract the text from PDF or DOCX documents while preserving the entire structure.
2. I will use proven APIs for text translation (Google Translate or others).
…
3. The translated text will be reassembled into a document with the same formatting, without loss of structure.
4. You will receive a fully prepared translated document in the same format.
My experience allows me to guarantee a quality result.
I would be happy to assist you with this project! If you are interested in my proposal, I am ready to get started.
Best regards,
Roman
-
219 1 1 Hello, I can help write this script, I did something similar a year ago using a bot on Python Aiogram.
Message me privately. We will discuss all the details of the work.
-
237 2 1 Hello!
I am ready to take on the work, I would be very grateful if you could provide this to me.
You can take a look at the examples of my work in the portfolio, if needed - we can discuss my experience in more detail in private messages.
-
Не думаю що можна зробити щоб працювало адекватно.
Буде втрачатись форматування тексту.
З pdf буде особливо помітно. Про картинки у документі взагалі мовчу.
-
Можливо, але ви кажете "не думаю", тобто ви не впевненні.
У мене це наскопопом не вийшло але я і сам не впевен, що 100% не реально, часу у мене обмаль тому і пропрацьоввую всі варіанти.
З іншими форматами не вийде. PDF або DOCS і є орігінал.
Від замовника є чітке ТЗ - він хоче софт, в який він закидує документ і отримує документ без витрати додкового часу на форматування. -
Я дякую вам пане Єгору за зворотній зв'язок. Що стосується ціни, там написано - ціна обговорюється, але я не думаю що вам не варто її пропонувати.
Не думаю - це перекладається, як я впевен)
Також, завдяки іншим виконавцям, налаштованим на роботу а не *базар-вокзал*, я вже бачу що це доволі трівіальна задача.
Не треба проецірувати свій досвід як аксіому, бо ми всі можемо помилятися. -
Current freelance projects in the category Python
Creation of a TikTok farm with income generation
602 USD
Looking for a person who can write software for a TikTok farm, so we can generate traffic and earn income. We are seeking a ready-made solution with a full cycle. Python, Bot Development ∙ 1 day 12 hours back ∙ 15 proposals |
AI Commenting Platform for TikTok and Instagram.Project Goal Develop a system that allows managing a large number of TikTok and Instagram accounts and automatically posting relevant comments under selected videos using AI. Main Functionality1. Account Management It is necessary to implement the ability to connect accounts:… AI & Machine Learning, Python ∙ 2 days 20 hours back ∙ 22 proposals |
Build a customer classification model1. There is client data in Mongo/SQL (approximately 20,000 entries with raw data). 2. It is necessary to build features and a classification model of clients into behavioral groups based on this data. 3. The project should be completed in Python. AI & Machine Learning, Python ∙ 4 days 14 hours back ∙ 43 proposals |
IT Automation of VAT Reporting
223 USD
It is necessary to develop a system for automating the transfer of sales data from the CRM to the accounting system Wafeq. The system should import bank and payment reports, automatically reconcile payments with invoices, generate invoices for VAT reporting, and minimize manual… AI & Machine Learning, Python ∙ 4 days 19 hours back ∙ 51 proposals |
Account reconciliation tool with the bank, cards, and accountantTechnical Assignment: Tool for Reconciling Accounts with Bank, Cards, and AccountantGeneral Goal A local tool (script/small application in Python) is needed, which is manually run once every 1-2 months on my computer and performs reconciliation between: Invoices I issued to… Python, Desktop Apps ∙ 5 days 7 hours back ∙ 43 proposals |