Cool! I don’t have time to look into this now, but I’m excited to see what you produce in this direction. As you know I’m pretty pessimistic that we can totally solve Goodhart effects, but I do expect we can mitigate them enough that for things other than superintelligent levels of optimization we can do better than we do now.
Cool! I don’t have time to look into this now, but I’m excited to see what you produce in this direction. As you know I’m pretty pessimistic that we can totally solve Goodhart effects, but I do expect we can mitigate them enough that for things other than superintelligent levels of optimization we can do better than we do now.
Agreed on both points.