I would hestitate to call it a solution because my motivation is in proving that partially optimized models are “better than nothing” in some sense, and transforming prediction problems into RL problems sounds like something that makes proofs much harder, rather than easier. But I agree that it could mitigate the problems that arise in practice, even if it doesn’t solve the theory.
I would hestitate to call it a solution because my motivation is in proving that partially optimized models are “better than nothing” in some sense, and transforming prediction problems into RL problems sounds like something that makes proofs much harder, rather than easier. But I agree that it could mitigate the problems that arise in practice, even if it doesn’t solve the theory.