Wei Dai comments on Exterminating life is rational

Wei Dai 7 Aug 2009 1:38 UTC
4 points

I agree that there doesn’t seem to be a workable solution—my last refuge was just destroyed by Vladimir Nesov.

I’m afraid I don’t understand the difficulty here. Let’s assume that Omega can access any point in configuration space and make that the reality. Then either (A) at some point it runs out of things with which to entice you to draw another card, in which case your utility function is bounded or (B) it never runs out of such things, in which case your utility function in unbounded.

Why is this so paradoxical again?
- PhilGoetz 7 Aug 2009 4:00 UTC
  2 points
  Parent
  If it’s not paradoxical, how many cards would you draw?
  - Wei Dai 7 Aug 2009 9:09 UTC
    2 points
    Parent
    I guess no more than 10 cards. That’s based on not being able to imagine a scenario such that I’d prefer .999 probability of death + .001 probability of scenario to the status quo. But it’s just a guess because Omega might have better imagination that I do, or understand my utility function better than I do.
    - Eliezer Yudkowsky 7 Aug 2009 19:10 UTC
      11 points
      Parent
      Omega offers you the healing of all the rest of Reality; every other sentient being will be preserved at what would otherwise be death and allowed to live and grow forever, and all unbearable suffering not already in your causal past will be prevented. You alone will die.
      
      You wouldn’t take a trustworthy 0.001 probability of that reward and a 0.999 probability of death, over the status quo? I would go for it so fast that there’d be speed lines on my quarks.
      
      Really, this whole debate is just about people being told “X utilons” and interpreting utility as having diminishing marginal utility—I don’t see any reason to suppose there’s more to it than that.
      What links here?
      cousin_it's comment on Working Mantras by Eliezer Yudkowsky (25 Aug 2009 11:32 UTC; 3 points)
      - Wei Dai 7 Aug 2009 19:48 UTC
        3 points
        Parent
        
        You alone will die.
        
        There’s no reason for Omega to kill me in the winning outcome...
        
        You wouldn’t take a trustworthy 0.001 probability of that reward and a 0.999 probability of death, over the status quo?
        
        Well, I’m not as altruistic as you are. But there must be some positive X such that even you wouldn’t take a trustworthy X probability of that reward and a 1-X probability of death, over the status quo, right? Suppose you’ve drawn enough cards to win this prize, what new prize can Omega offer you to entice you to draw another card?
        Eliezer Yudkowsky 7 Aug 2009 23:51 UTC
        0 points
        Parent
        
        There’s no reason for Omega to kill me in the winning outcome...
        
        Omega’s a bastard. So what?
        
        Well, I’m not as altruistic as you are.
        
        WHAT? Are you honestly sure you’re THAT not as altruistic as I am?
        
        But there must be some positive X such that even you wouldn’t take a trustworthy X probability of that reward and a 1-X probability of death, over the status quo, right?
        
        There’s the problem of whether the scenario I described which involves a “forever” and “over all space” actually has infinite utility compared to increments in my own life which even if I would otherwise live forever would be over an infinitesimal fraction of all space, but if we fix that with a rather smaller prize that I would still accept, then yes of course.
        
        Suppose you’ve drawn enough cards to win this prize, what new prize can Omega offer you to entice you to draw another card?
        
        Heal this Reality plus another three?
        Wei Dai 8 Aug 2009 23:25 UTC
        5 points
        Parent
        
        Omega’s a bastard. So what?
        
        That’s fine, I just didn’t know if that detail had some implication that I was missing.
        
        WHAT? Are you honestly sure you’re THAT not as altruistic as I am?
        
        Yes, I’m pretty sure, although I leave open the possibility that I may encounter an argument in the future that would persuade me to change my mind. My understanding is that most people have preferences like mine, so I’m surprised that you’re so surprised.
        
        It seems that I had missed the earlier posts on bounded vs. unbounded utility functions. I’ll follow up there to avoid retreading old ground.
        What links here?
        Wei Dai's comment on The Preference Utilitarian’s Time Inconsistency Problem by Wei Dai (15 Jan 2010 9:17 UTC; 3 points)
        Eliezer Yudkowsky 9 Aug 2009 18:20 UTC
        11 points
        Parent
        
        Yes, I’m pretty sure, although I leave open the possibility that I may encounter an argument in the future that would persuade me to change my mind. My understanding is that most people have preferences like mine, so I’m surprised that you’re so surprised.
        
        I’m shocked, and I hadn’t thought that most people had preferences like yours—at least would not verbally express such preferences; their “real” preferences being a whole separate moral issue beyond that. I would have thought that it would be mainly psychopaths, the Rand-damaged, and a few unfortunate moral philosophers with mistaken metaethics, who would decline that offer.
        
        I guess I would follow up with these questions: (1) When you see someone else hurting, or attend a friend’s funeral, do you feel sad; (2) are you more viscerally afraid of your own death than the strength of that emotion, if comparing two single cases; (3) do you decline to multiply out of a deliberate belief that all events after your own death ought to have zero utility to you, even if they feel sad when you think about them now; or (4) do you just generally want to leave the intuitive judgment (2) with its innate lack of multiplication undisturbed?
        
        Or if I’m asking the wrong questions here, then what is going on? I would expect most humans to instinctively feel that their whole tribe, to say nothing of the entire rest of reality, was worth something; and I would expect a rationalist to understand that if their own life does not literally have lexicographic priority (i.e., lives of others have infinitesimal=0 value in the utility function) then the multiplication factor here is overwhelming; and I would also expect you, Wei Dai, to not mistakenly believe that you were rationally forced to be lexicographically selfish regardless of your feelings… so I’m really not clear on what could be going on here.
        
        I guess my most important question would be: Do you feel that way, or are you deciding that way? In the former case, I might just need to make a movie showing one individual after another being healed, and after you’d seen enough of them, you would agree—the visceral emotional force having become great enough. In the latter case I’m not sure what’s going on.
        
        PS again: Would you accept a 60% probability of death in exchange for healing the rest of reality?
        Wei Dai 9 Aug 2009 21:57 UTC
        8 points
        Parent
        
        I guess I would follow up with these questions: (1) When you see someone else hurting, or attend a friend’s funeral, do you feel sad; (2) are you more viscerally afraid of your own death than the strength of that emotion, if comparing two single cases; (3) do you decline to multiply out of a deliberate belief that all events after your own death ought to have zero utility to you, even if they feel sad when you think about them now; or (4) do you just generally want to leave the intuitive judgment (2) with its innate lack of multiplication undisturbed?
        
        1: Yes. 2: Yes. 3: No. 4: I see a number of reasons not to do straight multiplication:
        
        Straight multiplication leads to an absurd degree of unconcern for oneself, given that the number of potential people is astronomical. It means, for example, that you can’t watch a movie for enjoyment, unless that somehow increases your productivity for saving the world. (In the least convenient world, watching movies uses up time without increasing productivity.)
        No one has proposed a form of utilitarianism that is free from paradoxes (e.g., the Repugnant Conclusion).
        My current position resembles the “Proximity argument” from Revisiting torture vs. dust specks:
        
        Proximity argument: don’t ask me to value strangers equally to friends and relatives. If each additional person matters 1% less than the previous one, then even an infinite number of people getting dust specks in their eyes adds up to a finite and not especially large amount of suffering.
        
        This agrees with my intuitive judgment and also seems to have relatively few philosophical problems, compared to valuing everyone equally without any kind of discounting.
        
        I guess my most important question would be: Do you feel that way, or are you deciding that way?
        
        My last bullet above already answered this, but I’ll repeat for clarification: it’s both.
        
        PS again: Would you accept a 60% probability of death in exchange for healing the rest of reality?
        
        This should be clear from my answers above as well, but yes.
        cousin_it 12 Aug 2009 13:57 UTC
        7 points
        Parent
        Oh, ’ello. Glad to see somebody still remembers the proximity argument. But it’s adapted to our world where you generally cannot kill a million distant people to make one close relative happy. If we move to a world where Omegas regularly ask people difficult questions, a lot of people adopting proximity reasoning will cause a huge tragedy of the commons.
        
        About Eliezer’s question, I’d exchange my life for a reliable 0.001 chance of healing reality, because I can’t imagine living meaningfully after being offered such a wager and refusing it. Can’t imagine how I’d look other LW users in the eye, that’s for sure.
        Wei Dai 13 Aug 2009 9:23 UTC
        7 points
        Parent
        
        Can’t imagine how I’d look other LW users in the eye, that’s for sure.
        
        I publicly rejected the offer, and don’t feel like a pariah here. I wonder what is the actual degree of altruism among LW users. Should we set up a poll and gather some evidence?
        Vladimir_Nesov 12 Aug 2009 14:34 UTC
        3 points
        Parent
        Cooperation is a different consideration from preference. You can prefer only to keep your own “body” in certain dynamics, no matter what happens to the rest of the world, and still benefit the most from, roughly speaking, helping other agents. Which can include occasional self-sacrifice a la counterfactual mugging.
        conchis 12 Aug 2009 14:22 UTC
        3 points
        Parent
        
        No one has proposed a form of utilitarianism that is free from paradoxes (e.g., the Repugnant Conclusion).
        
        I’d be interested to know what you think of Critical-Level Utilitarianism and Population-Relative Betterness as ways of avoiding the repugnant conclusion and other problems.
    - PhilGoetz 7 Aug 2009 14:54 UTC
      0 points
      Parent
      So does your answer change once you’ve drawn 10 cards and are still alive?
      - Wei Dai 7 Aug 2009 18:16 UTC
        2 points
        Parent
        No, if my guess is correct, then some time before I’m offered the 11th card, Omega will say “I can’t double your utility again” or equivalently, “There is no prize I can offer you such that you’d prefer a .5 probability of it to keeping what you have.”
- Wei Dai 7 Aug 2009 21:52 UTC
  1 point
  Parent
  After further thought, I see that case (B) can be quite paradoxical. Consider Eliezer’s utility function, which is supposedly unbounded as a function of how many years he lives. In other words, Omega can increase Eliezer’s utility without bound just by giving him increasingly longer lives. Expected utility maximization then dictates that he keeps drawing cards one after another, even though he knows that by doing so, with probability 1 he won’t live to enjoy his rewards.
  - Vladimir_Nesov 7 Aug 2009 22:16 UTC
    4 points
    Parent
    When you go to infinity, you’d need to define additional mathematical structure that answers your question. You can’t just conclude that the correct course of action is to keep drawing cards for eternity, doing nothing else. Even if at each moment the right action is to draw one more card, when you consider the overall strategy, the strategy of drawing cards for all time may be a wrong strategy.
    
    For example, consider the following preference on infinite strings. A string has utility 0, unless it has the form 11111.....11112222...., that is a finite number of 1 followed by infinite number of 2, in which case its utility is the number of 1s. Clearly, a string of this form with one more 1 has higher utility than a string without, and so a string with one more 1 should be preferred. But a string consisting only of 1s doesn’t have the non-zero-utility form, because it doesn’t have the tail of infinite number of 2s. It’s a fallacy to follow an incremental argument to infinity. Instead, one must follow a one-step argument that considers the infinite objects as whole.
    What links here?
    Cyan's comment on Exterminating life is rational by PhilGoetz (7 Aug 2009 23:07 UTC; 0 points)
    - Nick_Tarleton 7 Aug 2009 23:46 UTC
      4 points
      Parent
      See also Arntzenius, Elga, and Hawthorne: “Bayesianism, Infinite Decisions, and Binding”.
    - Nick_Tarleton 7 Aug 2009 23:46 UTC
      0 points
      Parent
      See also Arntzenius, Elga, and Hall: “Bayesianism, Infinite Decisions, and Binding”.
    - Wei Dai 7 Aug 2009 22:35 UTC
      0 points
      Parent
      What you say sounds reasonable, but I’m not sure how I can apply it in this example. Can you elaborate?
      
      Consider Eliezer’s choice of strategies at the beginning of the game. He can either stop after drawing n cards for some integer n, or draw an infinite number of cards. First, (supposing it takes 10 seconds to draw a card)
      
      EU(draw an infinite number of cards) = ¹⁄₂ U(live 10 seconds) + ¹⁄₄ U(live 20 seconds) + ¹⁄₈ U(live 30 seconds) …
      
      which obviously converges to a small number. On the other hand, EU(stop after n+1 cards) > EU(stop after n cards) for all n. So what should he do?
      - Vladimir_Nesov 7 Aug 2009 23:46 UTC
        2 points
        Parent
        This exposes a hole in the problem statement: what does the Omega’s prize measure? We determined that U0 is the counterfactual where Omega kills you, U1 is the counterfactual where it does nothing, but what is U2=U1+3*(U1-U0)? This seems to be the expected utility of the event where you draw the lucky card, in which case this event contains, in particular, your future decisions to continue drawing cards. But if it’s so, it places a limit on how your utility can be improved further during the latter rounds, since if your utility continues to increase, it contradicts the statement in the first round that your utility is going to be only U2, and no more. Utility can’t change, as each utility is a valuation of a specific event in the sample space.
        
        So, the alternative formulation that removes this contradiction is for Omega to only assert that the expected utility given that you receive a lucky card is no less than U2. In this case the right strategy seems to be continue drawing cards indefinitely, since the utility you receive could be in something other than your own life, now spent drawing cards only.
        
        This however seems to sidestep the issue. What if the only utility you see is in the future actions you do, which don’t include picking cards, and you can’t interleave cards with other actions, that is you must allot a given amount of time to picking cards.
        
        You can recast the problem of choosing each of the infinite number of decisions (or one among all available in some sense infinite sequences of decisions) to the problem of choosing a finite “seed” strategy for making decisions. Say, only a finite number of strategies is available, for example only what fits in the memory of the computer that starts the enterprise, that could since the start of the experiment be expanded, but the first version has a specified limit. In this case, the right program is as close to Busy Beaver is you can get, that is you draw cards as long as possible, but only finitely long, and after that you stop and go on to enjoy the actual life.
        What links here?
        Vladimir_Nesov's comment on Ingredients of Timeless Decision Theory by Eliezer Yudkowsky (20 Aug 2009 0:36 UTC; 4 points)
    - DanArmak 7 Aug 2009 22:28 UTC
      0 points
      Parent
      Why are you treating time as infinite? Surely it’s finite, just taking unbounded values?
      
      Even if at each moment the right action is to draw one more card, when you consider the overall strategy, the strategy of drawing cards for all time may be a wrong strategy.
      
      But you’re not asked to decide a strategy for all of time. You can change your decision at every round freely.
      - Vladimir_Nesov 7 Aug 2009 23:26 UTC
        2 points
        Parent
        
        But you’re not asked to decide a strategy for all of time. You can change your decision at every round freely.
        
        You can’t change any fixed thing, you can only determine it. Change is a timeful concept. Change appears when you compare now and tomorrow, not when you compare the same thing with itself. You can’t change the past, and you can’t change the future. What you can change about the future is your plan for the future, or your knowledge: as the time goes on, your idea about a fact in the now becomes a different idea tomorrow.
        
        When you “change” your strategy, what you are really doing is changing your mind about what you’re planning. The question you are trying to answer is what to actually do, what decisions to implement at each point. A strategy for all time is a generator of decisions at each given moment, an algorithm that runs and outputs a stream of decisions. If you know something about each particular decision, you can make a general statement about the whole stream. If you know that each next decision is going to be “accept” as opposed to “decline”, you can prove that the resulting stream is equivalent to an infinite stream that only answers “accept”, at all steps. And at the end, you have a process, the consequences of your decision-making algorithm consist in all of the decisions. You can’t change that consequence, as the consequence is what actually happens, if you changed your mind about making a particular decision along the way, the effect of that change is already factored in in the resulting stream of actions.
        
        The consequentialist preference is going to compare the effect of the whole infinite stream of potential decisions, and until you know about the finiteness of the future, the state space is going to contain elements corresponding to the infinite decision traces. In this state space, there is an infinite stream corresponding to one deciding to continue picking cards for eternity.
        DanArmak 7 Aug 2009 23:48 UTC
        0 points
        Parent
        Thanks, I understand now.
        PhilGoetz 7 Aug 2009 23:34 UTC
        0 points
        Parent
        Whoa.
        
        Is there something I can take that would help me understand that better?
        Vladimir_Nesov 8 Aug 2009 0:03 UTC
        1 point
        Parent
        I’m more or less talking just about infinite streams, which is a well-known structure in math. You can try looking at the following references. Or find something else.
        
        P. Cousot & R. Cousot (1992). `Inductive definitions, semantics and abstract interpretations’. In POPL ’92: Proceedings of the 19th ACM SIGPLAN-SIGACT symposium on Principles of programming languages, pp. 83-94, New York, NY, USA. ACM. http://www.di.ens.fr/~cousot/COUSOTpapers/POPL92.shtml
        
        J. J. M. M. Rutten (2003). `Behavioural differential equations: a coinductive calculus of streams, automata, and power series’. Theor. Comput. Sci. 308(1-3):1-53. http://www.cwi.nl/~janr/papers/files-of-papers/tcs308.pdf
  - Alicorn 7 Aug 2009 21:55 UTC
    3 points
    Parent
    Does Omega’s utility doubling cover the contents of the as-yet-untouched deck? It seems to me that it’d be pretty spiffy re: my utility function for the deck to have a reduced chance of killing me.
    - randallsquared 9 Aug 2009 0:17 UTC
      3 points
      Parent
      At first I thought this was pretty funny, but even if you were joking, it may actually map to the extinction problem, since each new technology has a chance of making extinction less likely, as well. As an example, nuclear technology had some probability of killing everyone, but also some probability of making Orion ships possible, allowing diaspora.
    - Alicorn 11 Aug 2009 19:09 UTC
      −2 points
      Parent
      While I’m gaming the system, my lifetime utility function (if I have one) could probably be doubled by giving me a reasonable suite of superpowers, some of which would let me identify the rest of the cards in the deck (X-ray vision, precog powers, etc.) or be protected from whatever mechanism the skull cards use to kill me (immunity to electricity or just straight-up invulnerability). Is it a stipulation of the scenario that nothing Omega does to tweak the utility function upon drawing a star affects the risks of drawing from the deck, directly or indirectly?
      - orthonormal 11 Aug 2009 19:23 UTC
        3 points
        Parent
        It should be, especially since the existential-risk problems that we’re trying to model aren’t known to come with superpowers or other such escape hatches.
- Cyan 7 Aug 2009 2:14 UTC
  0 points
  Parent
  Yeesh. I’m changing my mind again tonight. My only excuse is that I’m sick, so I’m not thinking as straight as I might.
  
  I was originally thinking that Vladimir Nesov’s reformulation showed that I would always accept Omega’s wager. But now I see that at some point U1+3*(U1-U0) must exceed any upper bound (assuming I survive that long).
  
  Given U1 (utility of refusing initial wager), U0 (utility of death), U_max, and U_n (utility of refusing wager n assuming you survive that long), it might be possible that there is a sequence of wagers that (i) offer positive expected utility at each step; (ii) asymptotically approach the upper bound if you survive; and (iii) have a probability of survival approaching zero. I confess I’m in no state to cope with the math necessary to give such a sequence or disprove its existence.
  - pengvado 7 Aug 2009 3:23 UTC
    1 point
    Parent
    There is no such sequence. Proof:
    
    In order for wager n to be nonnegative expected utility, P(death)*U_0 + (1-P(death))*U_(n+1) >= U_n. Equivalently, P(death this time | survived until n) ⇐ (U_(n+1)-U_n) / (U_(n+1)-U0).
    
    Assume the worst case, equality. Then the cumulative probability of survival decreases by exactly the same factor as your utility (conditioned on survival) increases. This is simple multiplication, so it’s true of a sequence of borderline wagers too.
    
    With a bounded utility function, the worst sequence of wagers you’ll accept in total is P(death) ⇐ (U_max-U0)/(U1-U0). Which is exactly what you’d expect.
    - Cyan 7 Aug 2009 4:55 UTC
      0 points
      Parent
      When there’s an infinite number of wagers, there can be a distinction between accepting the whole sequence at one go and accepting each wager one after another. (There’s a paradox associated with this distinction, but I forget what it’s called.) Your second-last sentence seems to be a conclusion about accepting the whole sequence at one go, but I’m worried about accepting each wager one after another. Is the distinction important here?
      - PhilGoetz 7 Aug 2009 5:02 UTC
        0 points
        Parent
        
        there can be a distinction between accepting the whole sequence at one go and accepting each wager one after another.
        
        Are you thinking of the Riemann series theorem? That doesn’t apply when the payoff matrix for each bet is the same (and finite).
        Cyan 7 Aug 2009 23:07 UTC
        0 points
        Parent
        No, it was this thing. I just couldn’t articulate it.
        Douglas_Knight 7 Aug 2009 6:49 UTC
        0 points
        Parent
        A bounded utility function probably gets you out of all problems along those lines.
        
        Certainly it’s good in the particular case: your expected utility (in the appropriate sense) is an increasing function of bets you accept and increasing sequences don’t have convergence issues.
        PhilGoetz 7 Aug 2009 14:56 UTC
        0 points
        Parent
        How would you bound your utility function? Just pick some arbitrary converging function f, and set utility’ = f(utility)? That seems arbitrary. I suspect it might also make theorems about expectation maximization break down.
        Douglas_Knight 7 Aug 2009 15:26 UTC
        0 points
        Parent
        No, I’m not advocating changing utility functions. I’m just saying that if your utility function is bounded, you don’t have either of these problems with infinity. You don’t have the convergence problem nor the original problem of probability of the good outcome going to zero. Of course, you still have the result that you keep making bets till your utility is maxed out with very low probability, which bothers some people.
  - PhilGoetz 7 Aug 2009 4:01 UTC
    0 points
    Parent
    How would it help if this sequence existed?
    - Cyan 7 Aug 2009 4:40 UTC
      0 points
      Parent
      If the sequence exists, then the paradox* persists even in the face of bounded utility functions. (Or possibly it already persists, as Vladimir Nesov argued and you agreed, but my cold-virus-addled wits aren’t sharp enough to see it.)
      
      * The paradox is that each wager has positive expected utility, but accepting all wagers leads to death almost surely.
      - PhilGoetz 7 Aug 2009 4:59 UTC
        0 points
        Parent
        Ah. So you don’t want the sequence to exist.
        Cyan 7 Aug 2009 12:45 UTC
        0 points
        Parent
        In the sense that if it exists, then it’s a bullet I will bite.