neverix comments on Is there a ML agent that abandons it’s utility function out-of-distribution without losing capabilities?

neverix 22 Feb 2023 17:04 UTC
2 points
1
This is the whole point of goal misgeneralization. They have experiments (albeit on toy environments that can be explained by the network finding the wrong algorithm), so I’d say quite plausible.
- Christopher King 22 Feb 2023 17:28 UTC
  1 point
  0
  Parent
  I guess the answer is yes then! (I think I now remember seeing a video about that.)