Pattern comments on Why GPT wants to mesa-optimize & how we might change this

Pattern 21 Sep 2020 3:48 UTC
2 points
Note that there’s a risk of mesa-optimization developing if lookahead improves performance at any point during GPT’s training.)
Why?
- John_Maxwell 22 Sep 2020 11:38 UTC
  2 points
  Parent
  My thought was that if lookahead improves performance during some period of the training, it’s liable to develop mesa-optimization during that period, and then find it to be a useful for other things later on.