FAWS comments on The Importance of Goodhart’s Law

FAWS 13 Mar 2010 11:57 UTC
13 points
0
In the case of a FAI G would be friendliness and G the friendliness definition. Avoiding a Goodhart’s Law effect on G is pretty much the core of the friendliness problem in a nutshell. An example of such a Goodhart’s Law effect would be the molecular smiley faces scenario.
- cousin_it 13 Mar 2010 12:03 UTC
  1 point
  Parent
  Ah, sorry. I’ve read the post as saying something different from what it actually says.
  - blogospheroid 15 Mar 2010 6:11 UTC
    1 point
    Parent
    Good discussion.
    
    The point I wanted to make was about Extrapolated volition as a strategy to avoid Goodhart’s law issues. If you extrapolate the volition of a person towards the “person he/she wants to be” and put a resulting goal as G*, it will be pretty much close to G as can be. I presented CEV as an example, since the audience is more familiar with it.
    
    And FAWS, your definition of G and G* in the friendliness scenario is perfect. I’ve nothing more to add there.