Yes OpenAI realtime API is really cool. When speaking to realtime API, I start each sentence with two words indicating what I want it to do. It’s clunky but it works. “Translate Chinese, what is the time?” “Reply Chinese, how are you?” Ideally yes I could write an app to prepend the instruction audio to each sentence.
If I had this as higher priority I’d actually want to setup this Twilio app.
Thanks for taking time to reply!
Yes OpenAI realtime API is really cool. When speaking to realtime API, I start each sentence with two words indicating what I want it to do. It’s clunky but it works. “Translate Chinese, what is the time?” “Reply Chinese, how are you?” Ideally yes I could write an app to prepend the instruction audio to each sentence.
If I had this as higher priority I’d actually want to setup this Twilio app.