What makes a utility function “true”? If I choose to literally wirehead—implant electrodes—I can sign a statement saying I consider my “true” utility function to be optimized by wireheading. Does that mean I’m not wireheading in your sense?
What makes a utility function “true”? If I choose to literally wirehead—implant electrodes—I can sign a statement saying I consider my “true” utility function to be optimized by wireheading. Does that mean I’m not wireheading in your sense?