Linda Linsefors comments on Vanessa Kosoy’s Shortform

Linda Linsefors 13 Nov 2019 14:09 UTC
LW: 1 AF: 1
AF
I agree that you can assign what ever belief you want (e.g. what ever is useful for the agents decision making proses) for for what happens in the counterfactual when omega is wrong, in decision problems where Omega is assumed to be a perfect predictor. However if you want to generalise to cases where Omega is an imperfect predictor (as you do mention), then I think you will (in general) have to put in the correct reward for Omega being wrong, becasue this is something that might actually be observed.
- Vanessa Kosoy 13 Nov 2019 14:13 UTC
  LW: 2 AF: 1
  AF Parent
  The method should work for imperfect predictors as well. In the simplest case, the agent can model the imperfect predictor as perfect predictor + random noise. So, it definitely knows the correct reward for Omega being wrong. It still believes in Nirvana if “idealized Omega” is wrong.