Podcast transcription services probably. They seem to cost around $1 per minute nowadays. I expect they’ll keep getting disrupted by AI. There’s already audio transcription AIs like the autogenerated subtitles on youtube, but they get context-dependent ambiguous words wrong. Seems like an obvious idea to plug them to a GPT style language model that can recognize the topic being talked about and uses that to pick an appropriate transcription for homonyms.
What’s the best way to turn audio to text?
I’m not super into that, but I’ve heard good things from people about Otter.ai
Podcast transcription services probably. They seem to cost around $1 per minute nowadays. I expect they’ll keep getting disrupted by AI. There’s already audio transcription AIs like the autogenerated subtitles on youtube, but they get context-dependent ambiguous words wrong. Seems like an obvious idea to plug them to a GPT style language model that can recognize the topic being talked about and uses that to pick an appropriate transcription for homonyms.