AI solution for translation
Technical assignment for the development of a service for automatic translation of YouTube videos from English to Russian using AI
1. General information
Goal: Development of a service that automatically downloads videos from YouTube, recognizes and translates speech from English to Russian, creating subtitles or voiceovers.
Main requirements: High processing speed, translation accuracy, process automation.
2. Functional requirements
2.1. Main functions
1. Video upload
• User input of the YouTube video link
• Ability to upload video files from PC
• Duration limit for videos (optional)
2. Speech recognition (Speech-to-Text)
• Automatic extraction of the audio track from the video
• Recognition of English speech using ASR models (e.g., OpenAI Whisper, Deepgram, Vosk)
• Splitting text into timestamps (timecode)
3. Translation (Machine Translation)
• Translation of text from English to Russian
• Use of neural network models (e.g., DeepL API, OpenAI GPT, Google Translate API)
• Preservation of timestamps during translation
4. Subtitle generation
• Formation of subtitle files (.srt, .vtt)
• Ability to edit the translation before final export
5. Voiceover of the translation (Text-to-Speech, TTS)
• Generation of voiceover for the translated text synchronized with the video
• Use of TTS engines (e.g., ElevenLabs, Microsoft Azure TTS, Google WaveNet)
• Adjustment of voice speed and intonation
6. Export of results
• Saving the translated video with the overlay voiceover
• Downloading subtitles separately (.srt, .vtt)
3. Non-functional requirements
3.1. Performance
• Processing a 10-minute video in no more than 5-10 minutes
• Speech recognition accuracy of at least 85-90%
• Translation correctness of at least 90% (when using neural network models)
3.2. Technology stack
• Backend: Python (FastAPI, Django)
• ASR: OpenAI Whisper / Deepgram / Vosk
• Machine Translation: OpenAI GPT-4 / DeepL API / Google Translate API
• TTS: ElevenLabs / Microsoft Azure TTS / Google WaveNet
• Video processing: FFmpeg
• Database: PostgreSQL / MongoDB
• Frontend: React / Vue.js
3.3. Integrations
• YouTube API (for video uploads)
• Cloud services for AI models (OpenAI, DeepL, Google Cloud)
4. Interface requirements
1. Ease of use – intuitive interface with minimal steps for the user.
2. Personal account – history of processed videos, ability to re-download.
3. Settings – choice of translation quality (basic / advanced), choice of voice for voiceover.
Expected result: A working service with high translation accuracy, fast processing, and a user-friendly interface.
Send your proposal with price, deadlines, and cases you have done in the AI niche in private messages immediately.
Do not send overpriced solutions.
-
1 day269 USD
631 5 0 1 day269 USDUnfortunately, you have disabled the ability to message you privately. Therefore, I hope you forgive my audacity to make a bid live. I am ready to complete your long-term and large-scale project in the shortest possible time at the lowest price, creating a user-friendly and intuitive interface, with rich settings, processing video in 10 minutes, with high accuracy of translation. Let's integrate all the achievements of humanity over thousands of years in the field of AI into your worthy project together!
-
14 days269 USD
276 2 2 14 days269 USDHello! I have experience in integrating AI solutions, as well as developing my own based on Tensorflow, PyTorch. Let's discuss the price and deadlines.
-
$250 - wrealy?
-
Прощу прощения, но такого бота неет есть по частям но ии такой есть но плохо работает.
-
Current freelance projects in the category AI & Machine Learning
Creation of an AI assistant for communication with ClientsIt is necessary to create an AI assistant for communication with Clients. The chat window will be located on our website, followed by communication with the bot. Questions about products, settings, capabilities, etc. In cases where the information is unknown or the request can… AI & Machine Learning, AI Consulting ∙ 6 hours 37 minutes back ∙ 27 proposals |
I am looking for a video editor who creates AI videos.Creation of AI videos for dentists and other experts Objective: To create short vertical videos for Instagram Reels, Facebook Reels, TikTok, and YouTube Shorts that explain complex topics in simple language and hold the viewer's attention through a combination of AI animation… AI & Machine Learning ∙ 14 hours 10 minutes back ∙ 1 proposal |
I am looking for a mentor/teacher for ComfyUI for online learning (working through RunPod)
16 USD
Hello. I am looking for a practicing specialist and mentor who can help me master working with ComfyUI. The main feature of my request is that the work will be done entirely in the cloud, without downloading the program to a local computer. I plan to rent a graphics card through… AI & Machine Learning ∙ 1 day back ∙ 1 proposal |
AI agent of sports nutrition technologistThe agent helps develop formulations for new sports nutrition products — protein bars, proteins, pre-workouts, isotonic drinks, bars, etc. The main feature: the agent knows the legislation of different countries and automatically takes it into account when creating the… AI & Machine Learning, Web Programming ∙ 1 day 1 hour back ∙ 55 proposals |
Integration of the analytics system with the Database in Tables
111 USD
The current analytics system needs to be brought to a stable working state. Currently, data from CRM, telephony, and advertising accounts is pulled through Supabase via MSP into Google Sheets, but some processes still require manual control. This needs to be eliminated.1.… AI & Machine Learning, Bot Development ∙ 1 day 15 hours back ∙ 30 proposals |