Say, the part of the AI that outputs speech to a microphone—what ring is that in?
I am not a professional software designer, so take all this with a grain of salt. That said, hardware I/O is ring 1, so the part that outputs speech to a speaker would be ring 1, while an off-the-shelf ‘text to speech’ app could run in ring 3. No part of a well-designed agent would output anything to an input device, such as a microphone.
That’s less of a rephrasing and more of a relocating the goalposts across state lines. “Choosing what to say,” properly unpacked, is approximately every part of the AI that doesn’t already exist.
I am not a professional software designer, so take all this with a grain of salt. That said, hardware I/O is ring 1, so the part that outputs speech to a speaker would be ring 1, while an off-the-shelf ‘text to speech’ app could run in ring 3. No part of a well-designed agent would output anything to an input device, such as a microphone.
Let me rephrase. The part of the agent that chooses what to say to the user—what ring is that in?
That’s less of a rephrasing and more of a relocating the goalposts across state lines. “Choosing what to say,” properly unpacked, is approximately every part of the AI that doesn’t already exist.
Yes. That’s the problem with the ring architecture.
As opposed to a problem with having a massive black box labeled “decisionmaking” in your AI plans, and not knowing how to break it down into subgoals?