Okay, I guess I’m still confused. So far I’ve loved everything I’ve read on this site and have been able to understand; I’ve appreciated/agreed with the first 110 pages of the Rationality ebook, felt a little skeptical for liking it so completely, and then reassured myself with the Aumann’s agreement theorem it mentions. So I feel like if this utility theorem which bases morality on preferences is commonly accepted around here, I’ll probably like it once I fully understand it. So bear with me as I ask more questions...
Whose preferences am I valuing? Only my own? Everyone’s equally? Those of an “average human”? What about future humans?
Yeah, by subjective, I meant that different humans would care about different things. I’m not really worried about basic morality, like not beating people up and stuff, but...
I have a feeling the hardest part of morality will now be determining where to strike a balance between individual human freedom and concern for the future of humanity.
Like, to what extent is it permissible to harm the environment? If something, like eating sugar for example, makes people dumber, should it be limited? Is population control like China’s a good thing?
Can you really say that most humans agree on where this line between individual freedom and concern for the future of humanity should be drawn? It seems unlikely...
I’m the wrong person to ask about “this utility theorem which bases morality on preferences” since I don’t really subscribe to this point of view.
I use the world “morality” as a synonym for “system of values” and I think that these values are multiple, somewhat hierarchical, and are NOT coherent. Moral decisions are generally taken on the basis of a weighted balance between several conflicting values.
By definition, you can only care about your own preferences. That being said, it’s certainly possible for you to have a preference for other people’s preferences to be satisfied, in which case you would be (indirectly) caring about the preferences of others.
The question of whether humans all value the same thing is a controversial one. Most Friendly AI theorists believe, however, that the answer is “yes”, at least if you extrapolate their preferences far enough. For more details, take a look at Coherent Extrapolated Volition.
Okay, that makes sense, but does this mean you can’t say someone else did something wrong, unless he was acting inconsistently with his personal preferences?
Ah, okay, I’ve been reading most hyperlinks here, but that one looks pretty long, so I will come back to it after I finish Rationality (or maybe my question will even be answered later on in the book...)
That is definitely not the idea behind CEV, though it may reflect the idea that a sizable majority will mostly share the same values under extrapolation.
This is an impressive failure to respond to what I said, which again was that you asked for an explanation of false data. “Most Friendly AI theorists” do not appear to think that extrapolation will bring all human values into agreement, so I don’t know what “arguments” you refer to or even what you think they seek to establish. Certainly the link above has Eliezer assuming the opposite (at least for the purpose of safety-conscious engineering).
ETA: This is the link to the full sub-thread. Note my response to dxu.
Okay, I guess I’m still confused. So far I’ve loved everything I’ve read on this site and have been able to understand; I’ve appreciated/agreed with the first 110 pages of the Rationality ebook, felt a little skeptical for liking it so completely, and then reassured myself with the Aumann’s agreement theorem it mentions. So I feel like if this utility theorem which bases morality on preferences is commonly accepted around here, I’ll probably like it once I fully understand it. So bear with me as I ask more questions...
Whose preferences am I valuing? Only my own? Everyone’s equally? Those of an “average human”? What about future humans?
Yeah, by subjective, I meant that different humans would care about different things. I’m not really worried about basic morality, like not beating people up and stuff, but...
I have a feeling the hardest part of morality will now be determining where to strike a balance between individual human freedom and concern for the future of humanity.
Like, to what extent is it permissible to harm the environment? If something, like eating sugar for example, makes people dumber, should it be limited? Is population control like China’s a good thing?
Can you really say that most humans agree on where this line between individual freedom and concern for the future of humanity should be drawn? It seems unlikely...
I’m the wrong person to ask about “this utility theorem which bases morality on preferences” since I don’t really subscribe to this point of view.
I use the world “morality” as a synonym for “system of values” and I think that these values are multiple, somewhat hierarchical, and are NOT coherent. Moral decisions are generally taken on the basis of a weighted balance between several conflicting values.
By definition, you can only care about your own preferences. That being said, it’s certainly possible for you to have a preference for other people’s preferences to be satisfied, in which case you would be (indirectly) caring about the preferences of others.
The question of whether humans all value the same thing is a controversial one. Most Friendly AI theorists believe, however, that the answer is “yes”, at least if you extrapolate their preferences far enough. For more details, take a look at Coherent Extrapolated Volition.
Okay, that makes sense, but does this mean you can’t say someone else did something wrong, unless he was acting inconsistently with his personal preferences?
Ah, okay, I’ve been reading most hyperlinks here, but that one looks pretty long, so I will come back to it after I finish Rationality (or maybe my question will even be answered later on in the book...)
That is definitely not the idea behind CEV, though it may reflect the idea that a sizable majority will mostly share the same values under extrapolation.
Do they have any arguments for this besides wishful thinking?
I told him “they” assume no such thing—his own link is full of talk about how to deal with disagreements.
Yes, I’ve read most of the arguments, they strike me as highly speculative and hand-wavy.
This is an impressive failure to respond to what I said, which again was that you asked for an explanation of false data. “Most Friendly AI theorists” do not appear to think that extrapolation will bring all human values into agreement, so I don’t know what “arguments” you refer to or even what you think they seek to establish. Certainly the link above has Eliezer assuming the opposite (at least for the purpose of safety-conscious engineering).
ETA: This is the link to the full sub-thread. Note my response to dxu.