I strongly suspect those meta-preferences are both critical for correct extrapolation of human values/preferences, AND are the place where we’ll find a fair bit of actual inconsistency of human desires.
“I want to be able to follow my illegible whims” seems like a very common and strong meta-preference, and I haven’t seen it modeled well in any discussions.
I strongly suspect those meta-preferences are both critical for correct extrapolation of human values/preferences, AND are the place where we’ll find a fair bit of actual inconsistency of human desires.
“I want to be able to follow my illegible whims” seems like a very common and strong meta-preference, and I haven’t seen it modeled well in any discussions.