Would it be possible to just apply model-based planning and show the treacherous turn on the first time?
Model-based planning is also AI, and we clearly have an available model of this environment.
Yes, that would work. I think Stuart Armstrong’s AI Toy Control problem already demonstrates this quite well—it’s the generalization to unknown dynamics that might be interesting and more compelling.
Would it be possible to just apply model-based planning and show the treacherous turn on the first time?
Model-based planning is also AI, and we clearly have an available model of this environment.
Yes, that would work. I think Stuart Armstrong’s AI Toy Control problem already demonstrates this quite well—it’s the generalization to unknown dynamics that might be interesting and more compelling.