As an active user of your Chrome extension, I'd like to suggest an improvement that would significantly enhance transcription accuracy.
I've been using the extension for audio transcription for over a month. When working with Russian language, numerous errors occur. This requires extensive manual corrections, reducing work efficiency.
OpenAI has released a new transcription model that substantially outperforms the current Whisper: https://openai.com/index/introducing-our-next-generation-audio-models/
Significantly improved accuracy (reduced Word Error Rate), tested on more than 100 languages
Better recognition of accents and regional speech patterns
Increased resilience to background noise during recording
Adaptation to varying speech speeds
Reduction of incorrect interpretations for complex words
Better context understanding and recognition of specific terminology
If the full GPT-4o-transcribe version proves too expensive, you could implement gpt-4o-mini-transcribe. Even this mini version significantly outperforms the current Whisper model in terms of accuracy and reliability.
This enhancement requires minimal effort β simply changing the model name used in the API request.
Please authenticate to join the conversation.
Completed
Feature Request / Bug Report
11 months ago

Web
Get notified by email when there are changes.
Completed
Feature Request / Bug Report
11 months ago

Web
Get notified by email when there are changes.