TurnTrout comments on Has anybody used quantification over utility functions to define “how good a model is”?

TurnTrout 2 Feb 2021 20:52 UTC
3 points
For sufficiently rich Z, that means that the summary must include a full model of the environment.
Is this a thoerem you’ve proven somewhere?
- johnswentworth 2 Feb 2021 22:54 UTC
  3 points
  Parent
  I have it in a notebook, might make a post soonish.
  - TurnTrout 3 Feb 2021 2:06 UTC
    5 points
    Parent
    I ask because I already have a result that says this in MDPs: you can compute all optimal value functions iff you know the environment dynamics up to isomorphism.
  - Eigil Rischel 11 Feb 2021 13:59 UTC
    3 points
    Parent
    (John made a post, I’ll just post this here so others can find it: https://www.lesswrong.com/posts/Dx9LoqsEh3gHNJMDk/fixing-the-good-regulator-theorem)