Stuart_Armstrong comments on The mathematics of reduced impact: help needed

Stuart_Armstrong 22 Feb 2012 9:54 UTC
0 points

Beware Goodhart’s Law:

One consideration is the amount of information in the coarse graining measures: we could set it up so there are more measurements made than there are bits in the disciple AI’s source code. Not a guarantee of anything, of course, but Goodhart’s law mainly derives from how short the success indicator is compared with the phenomena it’s trying to measure, so hence subverting the law is easier than improving the phenomena.