astridain comments on Moral strategies at different capability levels

astridain 27 Jul 2022 22:24 UTC
1 point
0
I like this breakdown! But I have one fairly big asterisk — so big, in fact, that I wonder if I’m misunderstanding you completely.
Care-morality mainly makes sense as an attitude towards agents who are much less capable than you—for example animals, future people, and people who aren’t able to effectively make decisions for themselves.
I’m not sure animals belong on that list, and I’m very sure that future people don’t. I don’t see why it should be more natural to care about future humans’ happiness than about their preferences/agency (unless, of course, one decides to be that breed of utilitarian across the board, for present-day people as well as future ones).
Indeed, the fact that one of the futures we want to avoid is one of future humans losing all control over their destiny, and instead being wireheaded to one degree or another by a misaligned A.I., handily demonstrates that we don’t think about future-people in those terms at all, but in fact generally value their freedom and ability to pursue their own preferences, just as we do our contemporaries’.
(As I said, I also disagree with taking this approach for animals. I believe that insofar as animals have intelligible preferences, we should try to follow those, not perform naive raw-utility calculations — so that e.g. the question is not whether a creature’s life is “worth living” in terms of a naive pleasure/pain ratio, but whether the animal itself seems to desire to exist. That being said, I do know nonzero amounts of people in this community have differing intuitions on this specific question, so it’s probably fair game to include in your descriptive breakdown.)
- Richard_Ngo 27 Jul 2022 22:45 UTC
  4 points
  2
  Parent
  I assume that you do think it makes sense to care about the welfare of animals and future people, and you’re just questioning why we shouldn’t care more about their agency?
  The reductio for caring more about animals’ agency is when they’re in environments where they’ll very obviously make bad decisions—e.g. there are lots of things which are poisonous and they don’t know; there are lots of cars that would kill them, but they keep running onto the road anyway; etc. (The more general principle is that the preferences of dumb agents aren’t necessarily well-defined from the perspective of smart agents, who can elicit very different preferences by changing the inputs slightly.)
  The reductio for caring more about future peoples’ agency is in cases where you can just choose their preferences for them. If the main thing you care about is their ability to fulfil their preferences, then you can just make sure that only people with easily-satisfied preferences (like: the preference that grass is green) come into existence.
  The other issue I have with focusing primarily on agency is that, as we think about creatures which are increasingly different from humans, my intuitions about why I care about their agency start to fade away. If I think about a universe full of paperclip maximizers with very high agency… I’m just not feeling it. Whereas at least if it’s a universe full of very happy paperclip maximizers, that feels more compelling.
  (I do care somewhat about future peoples’ agency; and I personally define welfare in a way which includes some component of agency, such that wireheading isn’t maximum-welfare. But I don’t think it should be the main thing.)
  (Also, as I wrote this comment, I realized that the phrasing in the original sentence you quoted is infelicitous, and so will edit it now.)
  - astridain 28 Jul 2022 1:03 UTC
    4 points
    2
    Parent
    Thank you! This is helpful. I’ll start with the bit where I still disagree and/or am still confused, which is the future people. You write:
    The reductio for caring more about future peoples’ agency is in cases where you can just choose their preferences for them. If the main thing you care about is their ability to fulfil their preferences, then you can just make sure that only people with easily-satisfied preferences (like: the preference that grass is green) come into existence.
    Sure. But also, if the main thing you care about is their ability to be happy, you can just make sure that only people whom green grass sends to the heights of ecstasy come into existence? This reasoning seems like it proves too much.
    I’d guess that your reply is going to involve your kludgier, non-wireheading-friendly idea of “welfare”. And that’s fair enough in terms of handling this kind of dilemma in the real world; but running with a definition of “welfare” that smuggles in that we also care about agency a bit… seems, to me, like it muddles the original point of wanting to cleanly separate the three “primary colours” of morality.
    That aside:
    Re: animals, I think most of our disagreement just dissolves into semantics. (Yay!) IMO, keeping animals away from situations which they don’t realize would kill them just falls under the umbrella of using our superior knowledge/technology to help them fulfill their own extrapolated preference to not-get-run-over-by-a-car. In your map this probably taken care of by your including some component of agency in “welfare”, so it all works out.
    Re: caring about paperclip paximizers: intuitively I care about creatures’ agencies iff they’re conscious/sentient, and I care more if they have feelings and emotions I can grok. So, I care a little about the paperclip-maximizers getting to maximize paperclips to their heart’s content if I am assured that they are conscious; and I care a bit more if I am assured that they feel what I would recognise as joy and sadness based on the current number of paperclips. I care not at all otherwise.
  - Shiroe 31 Jul 2022 4:57 UTC
    1 point
    0
    Parent
    
    If I think about a universe full of paperclip maximizers with very high agency… I’m just not feeling it. Whereas at least if it’s a universe full of very happy paperclip maximizers, that feels more compelling.
    
    This is really the old utilitarian argument that we value things (like agency) in addition to utility because they are instrumentally useful (which agency is). But if agency had never given us utility, we would never have valued it.