Yeah defining “harm”, or more formally, a “non-beneficial modification to a human being” is a hard task and is in many ways the core problem I am pointing at. Allowing people to take part in defining what is “harmful” to themselves is both potentially helpful as it brings local information and tricky because people may have already been ensnared by a hostile narrow AI to misunderstand “harm.”
Yeah defining “harm”, or more formally, a “non-beneficial modification to a human being” is a hard task and is in many ways the core problem I am pointing at. Allowing people to take part in defining what is “harmful” to themselves is both potentially helpful as it brings local information and tricky because people may have already been ensnared by a hostile narrow AI to misunderstand “harm.”