Yonatan Cale comments on MONA: Managed Myopia with Approval Feedback

Yonatan Cale 28 Jan 2025 21:37 UTC
1 point
0
For a simple task like booking a restaurant, we could just ask the (frozen) overseer-AI to pick^[1] actions, no?
The interesting application MONA seems to be when the myopic RL agent is able to produce better suggestions than the overseer
Edit: I elaborated
1. ^
  Plus maybe let the overseer observe the result and say “oops” and roll back that action, if we can implement a rollback in this context
- Rohin Shah 29 Jan 2025 9:40 UTC
  2 points
  0
  Parent
  For a simple task like booking a restaurant, we could just ask the (frozen) overseer-AI to pick^[1] actions, no?
  If it were as simple as “just ask an LLM to choose actions” someone would have deployed this product a while ago.
  But in any case I agree this isn’t the most interesting case for MONA, I talked about it because that’s what Daniel asked about.