Switch to English?
Yes
Переключитись на українську?
Так
Переключиться на русскую?
Да
Przełączyć się na polską?
Tak
High-tech tool for transcribing audio content. Thanks to the integration of modern neural networks, the bot provides studio-quality speech recognition, converting voice messages and audio files into structured text.

Functional advantages:

High accuracy (OpenAI Whisper-1): Speech recognition considering context, punctuation, and various accents even in noisy recordings.

Flexibility of formats: Support for forwarded voice messages (Voice) and uploaded audio files of any duration.

Export of results: The user can receive the text via a message in Telegram or as a ready document in .txt or .docx (Word) format.

Multilingualism: Automatic detection of the speaker's language (Ukrainian, English, and over 50 other languages).

Technical stack:

Core: Python

AI Integration: OpenAI API (Whisper Model)

File Processing: python-docx for document generation, FFmpeg for processing audio streams.

#ffmpeg #python #telebot #Telegram #openai-api #openai #api-development
Work details
Added 6 January
122 views
Freelancer
Vladimir K.
Ukraine Kherson  10  2

Available for hire Available for hire
10 Safes completed
On the service 5 years