Simulate an argument between the individual at State 1 and State 7. If the individual at State 1 is ultimately convinced
Does this include “convinced by hypnosis”, “convinced by brainwashing”, “convinced by a clever manipulation” etc.? How will AI tell the difference?
(Maybe “convincing by hypnosis” is considered a standard and ethical method of communication with lesser beings in the society of Stage 7. If a person A is provably more intelligent and rational than a person B, and a person A acts according to generally accepted ethical values, why not make the communication most efficient? To do otherwise would be a waste of resources, which is a crime, if the wasted resources could be instead spent on saving people’s lives.)
under the assumption that the individual at State 1 and the individual at State 7 both perfectly follow their own rules for proper argument
On point 1- OK, I screwed up slightly. Neither individual is allowed to argue with the other in a manner which the other one would see as brainwashing or unfair manipulation if in possesion of all the facts. The system rules out anything deceptive by correcting both parties on anything that is a question of fact.
On point 2- Then they both argue using their own rules of argument. Presumably, the individual at State 1 is unconvinced.
Neither individual is allowed to argue with the other in a manner which the other one would see as brainwashing or unfair manipulation if in possesion of all the facts.
Presumably this means “all the morally relevant facts,” since giving State 1 “all the facts” would be isomorphic to presenting him with the argument-simulation. But determining all the morally relevant facts is a big part of the problem statement. If the AI could determine which aspects of which actions were morally relevant, and to what the degree and sign of that moral valence was, it wouldn’t need CEV.
We could lock down the argument more, just to be safe.
I’m not sure whether a text-only channel between State 1 and State 7, allowing only if-then type statements with a probability attached, would allow brainwashing or hypnosis. But I’m also not sure how many State 1 racists would be convinced that racism is unethical, over such a channel.
How about the individual versions at State 1 and State 7 both get all the facts that they consider relevant themselves? And maybe a State 1 racist really wouldn’t have CEV towards non-racism- we just have to accept that.
Does this include “convinced by hypnosis”, “convinced by brainwashing”, “convinced by a clever manipulation” etc.? How will AI tell the difference?
(Maybe “convincing by hypnosis” is considered a standard and ethical method of communication with lesser beings in the society of Stage 7. If a person A is provably more intelligent and rational than a person B, and a person A acts according to generally accepted ethical values, why not make the communication most efficient? To do otherwise would be a waste of resources, which is a crime, if the wasted resources could be instead spent on saving people’s lives.)
What if the rules are incompatible?
On point 1- OK, I screwed up slightly. Neither individual is allowed to argue with the other in a manner which the other one would see as brainwashing or unfair manipulation if in possesion of all the facts. The system rules out anything deceptive by correcting both parties on anything that is a question of fact. On point 2- Then they both argue using their own rules of argument. Presumably, the individual at State 1 is unconvinced.
Presumably this means “all the morally relevant facts,” since giving State 1 “all the facts” would be isomorphic to presenting him with the argument-simulation. But determining all the morally relevant facts is a big part of the problem statement. If the AI could determine which aspects of which actions were morally relevant, and to what the degree and sign of that moral valence was, it wouldn’t need CEV.
We could lock down the argument more, just to be safe.
I’m not sure whether a text-only channel between State 1 and State 7, allowing only if-then type statements with a probability attached, would allow brainwashing or hypnosis. But I’m also not sure how many State 1 racists would be convinced that racism is unethical, over such a channel.
How about the individual versions at State 1 and State 7 both get all the facts that they consider relevant themselves? And maybe a State 1 racist really wouldn’t have CEV towards non-racism- we just have to accept that.