Wei Dai comments on Contest: $1,000 for good questions to ask to an Oracle AI

Wei Dai 5 Jul 2019 0:22 UTC
LW: 2 AF: 1
AF
(I’m still confused and thinking about this, but figure I might as well write this down before someone else does. :)

While thinking more about my submission and counterfactual Oracles in general, this class of ideas for using CO is starting to look like trying to implement supervised learning on top of RL capabilities, because SL seems safer (less prone to manipulation) than RL. Would it ever make sense to do this in reality (instead of just doing SL directly)?