I really like the way it handles headlines and bullet point lists!
In an ideal world I’d like the voice to sound less robotic. Something like https://elevenlabs.io/ or https://www.descript.com/overdub. How much I enjoy listening to text-to-speech content depends a lot on how grating I find the voice after long periods of listening.
Thanks! We’re currently using Azure TTS. Our plan is to review every couple months and update to use better voices when they become available on Azure or elsewhere. Elevenlabs is a good candidate but unfortunately they’re ~10x more expensive per hour of narration than Azure ($10 vs $1).
I think the cost per million words measure from the previous version of your comment was also useful to know. Did you replace it because it’s incorrect?
I really like the way it handles headlines and bullet point lists!
In an ideal world I’d like the voice to sound less robotic. Something like https://elevenlabs.io/ or https://www.descript.com/overdub. How much I enjoy listening to text-to-speech content depends a lot on how grating I find the voice after long periods of listening.
Thanks! We’re currently using Azure TTS. Our plan is to review every couple months and update to use better voices when they become available on Azure or elsewhere. Elevenlabs is a good candidate but unfortunately they’re ~10x more expensive per hour of narration than Azure ($10 vs $1).
I think the cost per million words measure from the previous version of your comment was also useful to know. Did you replace it because it’s incorrect?
I replaced it because it seemed like a less useful format.
Azure TTS cost per million characters = $16
Elevenlabs TTS cost per million characters = $180
1 million characters is roughly 200,000 words.
One hour of audio is roughly 9000 words.