Sure, an AI that ignores what you ask, and implements some form of CEV or whatever isn’t corrigible. Corrigibility is more following instructions than having your utility function.
Sure, an AI that ignores what you ask, and implements some form of CEV or whatever isn’t corrigible. Corrigibility is more following instructions than having your utility function.