the gears to ascension comments on Tests make creating AI hard

the gears to ascension 27 Sep 2017 17:06 UTC
3 points
Overfitting is the same thing as goodhart’s law. If using an inaccurate model helps, it’s because not trying too hard is necessary to avoid goodharting yourself.
- gjm 27 Sep 2017 17:41 UTC
  3 points
  Parent
  That’s … pretty much what I thought I was saying. Was I unclear, or have I misunderstood you somehow?
  - the gears to ascension 27 Sep 2017 18:15 UTC
    1 point
    Parent
    Ah, I thought “in some sense” meant you weren’t sure if you were using the metaphor correctly.
- Scott Garrabrant 3 Oct 2017 0:36 UTC
  1 point
  Parent
  I do not think that overfitting is “the same thing as” Goodhart’s law. I think Goodhart’s law is more broad. One of the mechanisms by which it works is similar to overfitting, but there is a lot more to Goodhart’s law than overfitting. In particular, I think the standard examples of Goodhart’s law include adversaries that are trying to break your proxy in a way that does not show up in overfitting. See also https://agentfoundations.org/item?id=1621