One consideration is the amount of information in the coarse graining measures: we could set it up so there are more measurements made than there are bits in the disciple AI’s source code. Not a guarantee of anything, of course, but Goodhart’s law mainly derives from how short the success indicator is compared with the phenomena it’s trying to measure, so hence subverting the law is easier than improving the phenomena.
One consideration is the amount of information in the coarse graining measures: we could set it up so there are more measurements made than there are bits in the disciple AI’s source code. Not a guarantee of anything, of course, but Goodhart’s law mainly derives from how short the success indicator is compared with the phenomena it’s trying to measure, so hence subverting the law is easier than improving the phenomena.