I can see why the reinforcement learning agent and the prediction agent would want to use a delusion box, but I don’t see why the goal maximizing agent would want one… maybe I should go look at the paper.
I can see why the reinforcement learning agent and the prediction agent would want to use a delusion box, but I don’t see why the goal maximizing agent would want one… maybe I should go look at the paper.