Support for Deepgram, other STT APIs

Whisper AI is one of those rare technologies that has almost exhausted my ability to generate feature requests because it pretty much just works, which is a great achievement!

But in the spirit of sharing ideas and because it's something that I thought about in the early days of testing this stuff out, I wanted to put in the idea that supporting a variety of SGT Cloud APIs might be useful for users.

I'm guessing that it would be more hassle than it's worth and it's not really in sync with the Whisper brand (!), but DeepGram's APIs are good, and there are a few others that are useful too.

One use case that I would highlight is that I believe a couple of these more niche platforms have explicit support for speaker accents in their API architecture.

Users who have very pronounced or more unusual accents might find that the recognition is enhanced with these without having to go through the trouble of fine-tuning a model.

Some other providers I’ve come across: Gladia, Speechmatics.

As ASR is really heating up, I imagine the list will just keep growing.

Please authenticate to join the conversation.

Upvoters
Status

In Review

Board
πŸ’‘

Feature Request

Date

About 1 month ago

Author

danielrosehill

Subscribe to post

Get notified by email when there are changes.