Audio transcription with authors breakdown and multi-modular verification
We are looking for a Python developer with experience working with API for speech recognition and machine learning to upgrade the existing transcription script.The project aims to create an advanced system that optimizes the process of transcription of audio files with breakdown on the authors of the conversation.The system will use several speaking recognition services (Google Cloud Speech-to-Text, Whisper, Microsoft Azure Speech), as well as machine learning to improve the quality and accuracy of transcription.The main objectives of the project:
1 .** Modification of the existing script:**
- Integrate the API of various speaking recognition services: Google Cloud Speech-to-Text, Whisper, and Microsoft Azure Speech to improve the quality of recognition.(There is a whisper, other models are needed)
Develop the logic of choosing the best service for a specific audio fragment based on quality and value.2nd*Development of the mechanism of long fragments:**
Automatic breakdown of the audio into long and short fragments.(it is already there)
Processing long fragments using the advanced capabilities of selected recognition services.Three**GPT integration and optimization for context analysis:**
Integration of the GPT model to verify and improve the quality of transcription by analyzing the context of conversation.(it isExamples for the test)
Development of processing algorithms returned by the GPT model of conclusions for the correction and supplementation of the received transcription.4 .* Testing and validation of the system:**
A comprehensive testing of the system on different types of audio materials.Analysis of the accuracy, speed and cost of transcriptions obtained using different services and algorithms.and 5.* Developing the user interface:**
Create a simple and intuitively understandable interface to run the script and view the results.( to be discussed )
Requirements for qualification:
Knowledge of Python programming language and experience of at least 3 years.Experience with Speech Recognition API (Google Cloud Speech-to-Text, Whisper, Microsoft Azure Speech) and other cloud services.Experience in machine learning, especially with natural language processing models such as GPT-3.Understanding the principles of processing and analysis of audio data.Capacity to analyze and solve complex tasks.Attention to the details and a desire for a high quality of the work.## The expected results:
At the end of the project, the developer must provide a finished system capable of:
Automatically break the audio into fragments and determine the authors of the speech.Optimally allocate fragments between different recognition services to get the best results.Use GPT to analyze context and improve transcription accuracy.Reporting the quality and cost of the transcription process.Budget and deadlines:
The budget of the project and the deadlines for its implementation will be agreed with the developer after a detailed discussion of the work volume and the assessment of the time for the implementation of all functions.## The selection process:
1 .Examination of portfolio and experience of working with similar tasks.2ndInterview to discuss the details of the project and the possibility of implementation of the intended.ThreeDiscussing the conditions of cooperation and signing the contract.
Current freelance projects in the category AI & Machine Learning
Build a customer classification model1. There is client data in Mongo/SQL (approximately 20,000 entries with raw data). 2. It is necessary to build features and a classification model of clients into behavioral groups based on this data. 3. The project should be completed in Python. AI & Machine Learning, Python ∙ 2 hours 32 minutes back ∙ 14 proposals |
Integration of dental scanner modules into CRM
601 USD
We have developed a CRM system for interaction with dentists and laboratories. It is necessary to integrate services like iTero, Sirona, Medit, and others so that files are pulled automatically. AI & Machine Learning, Java ∙ 4 hours 18 minutes back ∙ 11 proposals |
Create a team of AI agentsI want to create a team of AI agents that will help in everyday life, control business processes, analyze reports, etc. AI & Machine Learning ∙ 6 hours 39 minutes back ∙ 16 proposals |
IT Automation of VAT Reporting
223 USD
It is necessary to develop a system for automating the transfer of sales data from the CRM to the accounting system Wafeq. The system should import bank and payment reports, automatically reconcile payments with invoices, generate invoices for VAT reporting, and minimize manual… AI & Machine Learning, Python ∙ 7 hours 57 minutes back ∙ 28 proposals |
Development of a sales AI agent for an online store on PrestaShop 1.6 with KeyCRM integrationWe are looking for a developer or a small team to create an AI sales consultant for an online store of educational literature. The site runs on PrestaShop 1.6, CRM — KeyCRM. We need not an ordinary chatbot with ready-made answers, but an AI seller that will help the customer… AI & Machine Learning, Online Stores & E-commerce ∙ 13 hours 12 minutes back ∙ 34 proposals |