Joe Collman comments on Example population ethics: ordered discounted utility

Joe Collman 12 Mar 2019 13:22 UTC
1 point
It’s interesting. A few points:
Is there a natural extension for infinite population? It seems harder than most approaches to adapt.
I’m always suspicious of schemes that change what they advocate massively based on events a long time ago in a galaxy far, far away—in particular when it can have catastrophic implications. If it turns out there were 3^^^3 Jedi living in a perfect state of bliss, this advocates for preventing any more births now and forever.
Do you know a similar failure case for total utilitarianism? All the sadistic/repugnant/very-repugnant… conclusions seem to be comparing highly undesirable states—not attractor states. If we’d never want world A or B, wouldn’t head towards B from A, and wouldn’t head towards A from B (since there’d always be some preferable direction), does an A-vs-B comparison actually matter at all?
Total utilitarianism is an imperfect match for our intuitions when comparing arbitrary pairs of worlds, but I can’t recall seeing any practical example where it’d lead to clearly bad decisions. (perhaps birth-vs-death considerations?)
In general, I’d be interested to know whether you think an objective measure of per-person utility even makes sense. People’s take on their own situation tends to adapt to their expectations (as you’d expect, from an evolutionary fitness point of view). A zero-utility life from our perspective would probably look positive 1000 years ago, and negative (hopefully) in 100 years. This is likely true even if the past/future people were told in detail how the present-day ‘zero’ life felt from the inside: they’d assume our evaluation was simply wrong.
Or if we only care about (an objective measure of) subjective experience, does that mean we’d want people who’re all supremely happy/fulfilled/… with their circumstances to the point of delusion?
Measuring personal utility can be seen as an orthogonal question, but if I’m aiming to match my intuitions I need to consider both. If I consider different fixed personal-utility-metrics, it’s quite possible I’d arrive at a different population ethics. [edited from “different population utilities”, which isn’t what I meant]
I think you’re working in the dark if you try to match population ethics to intuition without fixing some measure of personal utility (perhaps you have one in mind, but I’m pretty hazy myself :)).
- Stuart_Armstrong 13 Mar 2019 12:49 UTC
  2 points
  Parent
  
  Is there a natural extension for infinite population? It seems harder than most approaches to adapt.
  
  None of the population ethics have decent extensions to infinite populations. I have a very separate idea for infinite populations here. I suppose the extension of this method to infinite population would use the same method as in that post, but use $(γ s (w) + i (w)) / (1 + γ)$ instead of $(s (w) + i (w)) / 2$ (where $s (w)$ and $i (w)$ are the limsup and liminf of utilities, respectively).
  
  I’m always suspicious of schemes that change what they advocate massively based on events a long time ago in a galaxy far, far away—in particular when it can have catastrophic implications. If it turns out there were 3^^^3 Jedi living in a perfect state of bliss, this advocates for preventing any more births now and forever.
  
  You can always zero out those utilities by decree, and only consider utilities that you can change. There are other patches you can apply. By talking this way, I’m revealing the principle I’m most willing to sacrifice: elegance.
  
  Do you know a similar failure case for total utilitarianism? All the sadistic/repugnant/very-repugnant… conclusions seem to be comparing highly undesirable states—not attractor states. If we’d never want world A or B, wouldn’t head towards B from A, and wouldn’t head towards A from B (since there’d always be some preferable direction), does an A-vs-B comparison actually matter at all?
  
  If A is repugnant and C is now, you can get from C to A by doing improvements (by the standard of total utilitarianism) every step of the way. Similarly, if B is worse than A on that standard, there is a hypothetical path from B to A which is an “improvement” at each step (most population ethics have this property, but not all—you need some form of “continuity”).
  
  It’s possible that the most total-ut distribution of matter in the universe is a repugnant way; in that case, a sufficiently powerful AI may find a way to reach that.
  
  In general, I’d be interested to know whether you think an objective measure of per-person utility even makes sense.
  
  a) I don’t think it makes sense in any strongly principled way, b) I’m trying to build one anyway :-)
  - Ghatanathoah 11 Sep 2021 2:10 UTC
    1 point
    Parent
    You can always zero out those utilities by decree, and only consider utilities that you can change. There are other patches you can apply. By talking this way, I’m revealing the principle I’m most willing to sacrifice: elegance.
    It’s been a long time since you posted this, but if you see my comment, I’d be curious about what some others patches one could apply are. I have pretty severe scrupulosity issues around population ethics and often have trouble functioning because I can’t stop thinking about them. I dislike pure total utilitarianism, but I have trouble rejecting it precisely because of “galaxy far far away” type issues. I spend a lot of time worrying about the idea that I am forced to choose between two alternatives:
    1) That (to paraphrase what you said in your critique of total utilitarianism) it is a morally neutral act to kill someone if you replace them with someone whose lifetime utility is equal to the first person’s remaining lifetime utility (and on a larger scale, the Repugnant Conclusion), or
    2.That the human race might be obligated to go extinct if it turns out there is some utopia in some other branch of the multiverse, or the Andromeda Galaxy, or in some ancient, undiscovered fallen civilization in the past. Or that if the Earth was going to explode and I could press a button to save it, but it would result in future generations living slightly lower quality lives than present generations, I shouldn’t push the button.
    I’d really like to know some ways that I can reject both 1 and 2. I really admire your work on population ethics and find that your thinking on the subject is really closely aligned with my own, except that you’re better at it than me :)
    - Stuart_Armstrong 13 Sep 2021 11:52 UTC
      7 points
      Parent
      Hey there!
      
      I haven’t been working much on population ethics (I’m more wanting to automate the construction of values from human preferences so that an AI could extract a whole messy theory from it).
      
      My main thought on these issues is to set up a stronger divergence between killing someone and not bringing them into existence. For example, we could restrict preference-satisfaction to existing beings (and future existing beings). So if they don’t want to be killed, that counts as a negative if we do that, even if we replace them with someone happier.
      
      This has degenerate solutions too—it incentivises producing beings that are very easy to satisfy and that don’t mind being killed. But note that “create beings that score max on this utility scale, even if they aren’t conscious or human” is a failure mode for average and total utilitarianism as well, so this isn’t a new problem.
      - Ghatanathoah 15 Sep 2021 0:39 UTC
        1 point
        Parent
        So if they don’t want to be killed, that counts as a negative if we do that, even if we replace them with someone happier.
        
        I have that idea as my “line of retreat.” My issue with it is that it is hard to calibrate it so that it leaves as big a birth-death asymmetry as I want without degenerating into full-blown anti-natalism. There needs to be some way to say that the new happy person’s happiness can’t compensate for the original person’s death without saying that the original person’s own happiness can’t compensate for their own death, which is hard. If I calibrate it to avoid anti-natalism it becomes such a small negative that it seems like it could easily be overcome by adding more people with only a little more welfare.
        There’s also the two step “kill and replace” method, where in step one you add a new life barely worth living without affecting anyone. Since the new person exists now, they count the same as everyone else, so then in the second step you kill someone and transfer their resources to the new person. If this process gives the new person the same amount of utility as the old one, it seems neutral under total utilitarianism. I suppose under total preference utilitarianism its somewhat worse, since you now have two people dying with unsatisfied preferences instead of one, but it doesn’t seem like a big enough asymmetry for me.
        I feel like in order to reject the two step process, and to have as big an asymmetry as I want, I need to be able to reject “mere addition” and accept the Sadistic Conclusion. But that in turn leads to “galaxy far far away issues” where it becomes wrong to have children because of happy people in some far off place. Or “Egyptology” issues where its better for the world to end than for it to decline so future people have somewhat worse lives, and we are obligated to make sure the Ancient Egyptians didn’t have way better lives than ours before we decide on having children. I just don’t know. I want it to stop hurting my brain so badly, but I keep worrying about how there’s no solution that isn’t horrible or ridiculous.
        This has degenerate solutions too—it incentivises producing beings that are very easy to satisfy and that don’t mind being killed.
        For this one, I am just willing to just decree that creating creatures with a diverse variety of complex human-like psychologies is good, and creating creatures with weird minmaxing unambitious creatures is bad (or at least massively sub-optimal). To put it another way, Human Nature is morally valuable and needs to be protected.
        Another resource that helped me on this was Derek Parfit’s essay “What Makes Someone’s Life Go Best.” You might find it helpful, it parallels some of your own work on personal identity and preferences. The essay describes which of our preferences we feel count as part of our “self interest” and which do not. It helped me understand things, like why people general feel obligated to respect people’s “self interest” preferences (i.e. being happy, not dying), but not their “moral preferences” (i.e. making the country a theocracy, executing heretics).
        Parfit’s “Success Theory,” as he calls it, basically argues that only preferences that are “about your own life” count as “welfare” or “self interest.” So that means that we would not be making the world a better place by adding lives who prefer that the speed of light stay constant, or that electrons keep having negative charges. That doesn’t defuse the problem entirely, you could still imagine creating creatures with super unambitious life goals. But it gets it part of the way, the rest, again, I deal with by “defending Human Nature.”
        I’m more wanting to automate the construction of values from human preferences
        I had a question about that. It is probably a silly question since my understanding of decision and game theory is poor. When you were working on that you said that there was no independence of irrelevant alternatives. I’ve noticed that IIA is something that trips me up a lot when I think about population ethics. I want to be able to say something like “Adding more lives might be bad if there is still the option to improve existing ones instead, but might be good if the existing ones have already died and that option is foreclosed.” This violates IIA because I am conditioning whether adding more lives is good on whether there is another alternative or not.
        I was wondering if my brain might be doing the thing you described in your post on no IIA, where it is smashing two different values together and getting different results if there are more alternatives. It probably isn’t I am probably just being irrational, but reading that post just felt familiar.
  - Joe Collman 13 Mar 2019 16:00 UTC
    1 point
    Parent
    Thanks. I’ll check out the infinite idea.
    On repugnance, I think I’ve been thinking too much in terms of human minds only. In that case there really doesn’t seem to be a practical problem: certainly if C is now, continuous improvements might get us to a repugnant A—but my point is that that path wouldn’t be anywhere close to optimal. Total-ut prefers A to C, but there’d be a vast range of preferable options every step of the way—so it’d always end up steering towards some other X rather than anything like A.
    I think that’s true if we restrict to human minds (the resource costs of running a barely content one being a similar order of magnitude to those of running a happy one).
    But of course you’re right as soon as we’re talking about e.g. rats (or AI-designed molecular scale minds...). I can easily conceive of metrics valuing 50 happy rats over 1 happy human. I don’t think rat-world fits most people’s idea of utopia.
    I think that’s the style of repugnance that’d be a practical danger: vast amounts of happy-but-simple minds.
    - Stuart_Armstrong 14 Mar 2019 13:01 UTC
      2 points
      Parent
      
      I think that’s the style of repugnance that’d be a practical danger: vast amounts of happy-but-simple minds.
      
      Yep, that does seem a risk. I think that’s what the “muzak and potatoes” formulation of repugnance is about.