There are a number of voice taking apps and productivity tools on the market whose secret sauce is essentially applying a system prompt on top of a dictated text passed through Whisper.
From what I've seen, this actually isn't really that hard to do. I use a custom AI frontend and have created dozens of fairly simple system prompts to do everything from converting text into to-do list format through to making it more professional, making it more concise, etc.
I know that the current feature set is focused really on dictation and this might be overstepping the boundary into productivity tools, but on the other hand it may actually make sense and save people from requiring multiple components to do this very useful and everyday task.
I shared a library of text transformation system prompts on Hugging Face yesterday, which of course are totally open source, and if the idea ever sounds appealing then you are free to use it.
https://huggingface.co/datasets/danielrosehill/text-transformation-system-prompts
Please authenticate to join the conversation.
In Review
Feature Request
About 1 month ago
Daniel Rosehill
Get notified by email when there are changes.
In Review
Feature Request
About 1 month ago
Daniel Rosehill
Get notified by email when there are changes.