Good point.
(That said, it seems like to useful check to see what the optimal policy will do. And if someone believes it won’t achieve the optimal policy, it seems useful to try to understand the barrier that stops that. I don’t feel quite clear on this yet).
Good point.
(That said, it seems like to useful check to see what the optimal policy will do. And if someone believes it won’t achieve the optimal policy, it seems useful to try to understand the barrier that stops that. I don’t feel quite clear on this yet).