MIRI makes the methodological proposal that it simplifies the issue of friendliness (or morality or safety) to deal with the whole of human value, rather than identifying a morally relevant subset. Having done that, it concludes that human morality is extremely complex. In other words, the payoff in terms of methodological simplification never arrives, for all that MIRI relieves itself of the burden of coming up with a theory of morality. Since dealing with human value in total is in absolute terms very complex, the possibility remains open that identifying the morally relevant subset of values is relatively easier (even if still difficult in absolute terms) than designing an AI to be friendly in terms of the totality of value, particularly since philosophy offers a body of work that seeks to identify simple underlying principles of ethics.
The idea of a tractable, rationally discoverable , set of ethical principles is a weaker form of, or lead into, one of the most common objections to the MIRI approach: “Why doesn’t the AI figure out morality itself?”.
Thanks that’s informative. Not entirely sure your own position is from your post, but I agree with what I take your implication to be—that a rationally discoverable set of ethics might not be as sensible notion as it sounds. But on the other hand human preference satisfaction seems a really bad goal—many human preferences in the world are awful—take a desire for power over others for example. Otherwise human society wouldn’t have wars, torture, abuse etc etc. I haven’t read up on CEV in detail, but from what I’ve seen it suffers from a confusion that somehow decent preferences are gained simply by obtaining enough knowledge? I’m not fully up to speed here so I’m willing to be corrected.
EDIT> Oh… CEV is the main accepted approach at MIRI :-( I assumed it was one of many
that a rationally discoverable set of ethics might not be as sensible notion as it sounds.
That wasn’t the point I thought I was making. I thought I was making the point that the idea of tractable sets of moral truths had been sidelined rather than sidestepped...that it had been neglected on the basis of a simplification that has not been delivered.
Having said that, I agree that discoverable morality has the potential downside of being inconvenient to, or unfriendly for , humans: the one true morality might be some deep ecology that required a much lower human population, among many other possibilities. That might have been a better argument against discoverable morality ethics than the one actually presented.
But on the other hand human preference satisfaction seems a really bad goal—many human preferences in the world are awful—take a desire for power over others for example. Otherwise human society wouldn’t have wars, torture, abuse etc etc.
Thanks for reply. That makes more sense to me now. I agree with a fair amount of what you say. I think you’d have a sense from our previous discussions why I favour physicalist approaches to the morals of a FAI, rather than idealist or dualist, regardless of whether physicalism is true or false. So I won’t go there. I pretty much agree with the rest.
EDIT> Oh just on the deep ecology point, I believe that might be solvable by prioritising species based on genetic similarity to humans. So basically weighting humans highest and other species less so based on relatedness. I certainly wouldn’t like to see a FAI adopting the view that people have of “humans are a disease” and other such views, so hopefully we can find a way to avoid that sort of thing.
Yes some humans seem to have adopted this view where intelligence moves from being a tool and having instrumental value to being instrinsically/terminally valuable. I find often the justifcation for this to be pretty flimsy, though quite a few people seem to have this view. Let’s hope a AGI doesn’t lol.
MIRI makes the methodological proposal that it simplifies the issue of friendliness (or morality or safety) to deal with the whole of human value, rather than identifying a morally relevant subset. Having done that, it concludes that human morality is extremely complex. In other words, the payoff in terms of methodological simplification never arrives, for all that MIRI relieves itself of the burden of coming up with a theory of morality. Since dealing with human value in total is in absolute terms very complex, the possibility remains open that identifying the morally relevant subset of values is relatively easier (even if still difficult in absolute terms) than designing an AI to be friendly in terms of the totality of value, particularly since philosophy offers a body of work that seeks to identify simple underlying principles of ethics.
The idea of a tractable, rationally discoverable , set of ethical principles is a weaker form of, or lead into, one of the most common objections to the MIRI approach: “Why doesn’t the AI figure out morality itself?”.
Thanks that’s informative. Not entirely sure your own position is from your post, but I agree with what I take your implication to be—that a rationally discoverable set of ethics might not be as sensible notion as it sounds. But on the other hand human preference satisfaction seems a really bad goal—many human preferences in the world are awful—take a desire for power over others for example. Otherwise human society wouldn’t have wars, torture, abuse etc etc. I haven’t read up on CEV in detail, but from what I’ve seen it suffers from a confusion that somehow decent preferences are gained simply by obtaining enough knowledge? I’m not fully up to speed here so I’m willing to be corrected.
EDIT> Oh… CEV is the main accepted approach at MIRI :-( I assumed it was one of many
That wasn’t the point I thought I was making. I thought I was making the point that the idea of tractable sets of moral truths had been sidelined rather than sidestepped...that it had been neglected on the basis of a simplification that has not been delivered.
Having said that, I agree that discoverable morality has the potential downside of being inconvenient to, or unfriendly for , humans: the one true morality might be some deep ecology that required a much lower human population, among many other possibilities. That might have been a better argument against discoverable morality ethics than the one actually presented.
Most people have a preference for not being the victims of war or torture. Maybe something could be worked up from that.
I’ve seen comments to the effect that to the effect that it has been abandoned. The situation is unclear.
Thanks for reply. That makes more sense to me now. I agree with a fair amount of what you say. I think you’d have a sense from our previous discussions why I favour physicalist approaches to the morals of a FAI, rather than idealist or dualist, regardless of whether physicalism is true or false. So I won’t go there. I pretty much agree with the rest.
EDIT> Oh just on the deep ecology point, I believe that might be solvable by prioritising species based on genetic similarity to humans. So basically weighting humans highest and other species less so based on relatedness. I certainly wouldn’t like to see a FAI adopting the view that people have of “humans are a disease” and other such views, so hopefully we can find a way to avoid that sort of thing.
I think you have an idea from our previous discussions why I don’t think you physicalism, etc, is relevant to ethics.
Indeed I do! :-)
Or simply extremly smart AI’s > human minds.
Yes some humans seem to have adopted this view where intelligence moves from being a tool and having instrumental value to being instrinsically/terminally valuable. I find often the justifcation for this to be pretty flimsy, though quite a few people seem to have this view. Let’s hope a AGI doesn’t lol.