orthonormal comments on HCH as a measure of manipulation

orthonormal 13 Mar 2017 21:29 UTC
0 points
AF
Re #1, an obvious set of questions to include in $q$ are questions of approval for various aspects of the AI’s policy. (In particular, if we want the AI to later calculate a human’s HCH and ask it for guidance, then we would like to be sure that HCH’s answer to that question is not manipulated.)