Whisper AI is one of those rare technologies that has almost exhausted my ability to generate feature requests because it pretty much just works, which is a great achievement!
But in the spirit of sharing ideas and because it's something that I thought about in the early days of testing this stuff out, I wanted to put in the idea that supporting a variety of SGT Cloud APIs might be useful for users.
I'm guessing that it would be more hassle than it's worth and it's not really in sync with the Whisper brand (!), but DeepGram's APIs are good, and there are a few others that are useful too.
One use case that I would highlight is that I believe a couple of these more niche platforms have explicit support for speaker accents in their API architecture.
Users who have very pronounced or more unusual accents might find that the recognition is enhanced with these without having to go through the trouble of fine-tuning a model.
Some other providers Iβve come across: Gladia, Speechmatics.
As ASR is really heating up, I imagine the list will just keep growing.
Please authenticate to join the conversation.
In Review
Feature Request
About 1 month ago
danielrosehill
Get notified by email when there are changes.
In Review
Feature Request
About 1 month ago
danielrosehill
Get notified by email when there are changes.