Eliezer Yudkowsky comments on Rationality is Systematized Winning

Eliezer Yudkowsky Apr 3, 2009, 6:00 PM
4 points
Given that I one-box on Newcomb’s Problem and keep my word as Parfit’s Hitchhiker, it would seem that the rational course of action is to not steer your car even if it crashes (if for some reason winning that game of chicken is the most important thing in the universe).
What links here?
- orthonormal's comment on Rationalists lose when others choose by PhilGoetz (Jun 16, 2009, 8:02 PM; 23 points)
- James_Miller Apr 3, 2009, 7:12 PM
  4 points
  Parent
  You are playing chicken with your irrational twin. Both of you would rather survive than win. Your twin, however, doesn’t understand that it’s possible to die when playing chicken. In the game your twin both survives and wins whereas you survive but lose.
  - Aurini Apr 3, 2009, 8:06 PM
    1 point
    Parent
    Then you murder the twin prior to the game of chicken, and fake his suicide. Or you intimidate the twin, using your advanced rational skills to determine how exactly to best fill them with fear and doubt.
    
    But before murdering or risking an uncertain intimidation feint, there’s another question you need to ask yourself. How certain are you that the twin is irrational? The Cold War was (probably) a perceptual error; neither side realized that they were in a prisoners dilemma, they both assumed that the other side preferred “unbalanced armament” over “mutual armament” over “mutual disarmament;” in reality, the last two should have been switched.
    
    Worst case scenario? You die playing chicken, because the stakes were worth it. The Rational path isn’t always nice.
    
    (There are some ethical premises implicit in this argument, premises which I plan to argue are natural derivatives from Game Theory… but I’m still working on that article.)
  - rwallace Apr 3, 2009, 7:21 PM
    0 points
    Parent
    My answer to that one is that I don’t play chicken in the first place unless the stake is something I’m prepared to die for.
    - James_Miller Apr 3, 2009, 7:27 PM
      7 points
      Parent
      There are lots of chicken like games that don’t involve death. For example, your boss wants some task done and either you or a co-worker can do it. The worst outcome for both you and the co-worker is for the task to not get done. The best is for the other person to do the task.
      - rwallace Apr 3, 2009, 7:30 PM
        2 points
        Parent
        My answer still applies—I’m not going to make a song and dance about who does it, unless the other guy has been systematically not pulling his weight and it’s got to the point where that matters more to me than this task getting done.
- Jonathan_Graehl Apr 3, 2009, 9:24 PM
  3 points
  Parent
  For Newcomb’s Problem, is it fair to say that if you believe the given information, the crux is whether you believe it’s possible (for Omega) to have a 99%+ correct prediction of your decision based on the givens? Refusal to accept that seems to me the only justification for two-boxing. Perhaps that’s a sign that I’m less tied to a fixed set of “rationalist” procedures than a perfect rationalist would be, but I would feel like I were pretending to say otherwise.
  
  I also wonder if the many public affirmations I’ve heard of “I would one-box Newcomb’s Problem” are attempts at convincing Omega to believe us in the unlikely event of actually encountering the Problem. It does give a similar sort of thrill to “God will rapture me to heaven.”
- rwallace Apr 3, 2009, 6:39 PM
  1 point
  Parent
  +1 for “Rationalists win”. What is Parfit’s Hitchhiker? I couldn’t find an answer on Google.
  - grobstein Apr 3, 2009, 7:05 PM
    3 points
    Parent
    It’s a test case for rationality as pure self-interest (really it’s like an altruistic version of the game of Chicken).
    
    Suppose I’m purely selfish and stranded on a road at night. A motorist pulls over and offers to take me home for $100, which is a good deal for me. I only have money at home. I will be able to get home then IFF I can promise to pay $100 when I get home.
    
    But when I get home, the marginal benefit to paying $100 is zero (under assumption of pure selfishness). Therefore if I behave rationally at the margin when I get home, I cannot keep my promise.
    
    I am better off overall if I can commit in advance to keeping my promise. In other words, I am better off overall if I have a disposition which sometimes causes me to behave irrationally at the margin. Under the self-interest notion of rationality, then, it is rational, at the margin of choosing your disposition, to choose a disposition which is not rational under the self-interest notion of rationality. (This is what Parfit describes as an “indirectly self-defeating” result; note that being indirectly self-defeating is not a knockdown argument against a position.)
    - rwallace Apr 3, 2009, 7:19 PM
      2 points
      Parent
      Ah, thanks. I’m of the school of thought that says it is rational both to promise to pay the $100, and to have a policy of keeping promises.
      - SarahNibs Apr 3, 2009, 8:22 PM
        1 point
        Parent
        I think it is both right and expected-utility-maximizing to promise pay the $100, right to pay the $100, and not expected-utility-maximizing to pay the $100 under standard assumptions of you’ll never see the driver again or whatnot.
        thomblake Apr 3, 2009, 8:31 PM
        1 point
        Parent
        You’re assuming it does no damage to oneself to break one’s own promises. Virtue theorists would disagree.
        
        Breaking one’s promises damages one’s integrity—whether you consider that a trait of character or merely a valuable fact about yourself, you will lose something by breaking your promise even if you never see the fellow again.
        grobstein Apr 3, 2009, 8:39 PM
        1 point
        Parent
        Your argument is equivalent to, “But what if your utility function rates keeping promises higher than a million orgasms, what then?”
        
        The hypo is meant to be a very simple model, because simple models are useful. It includes two goods: getting home, and having $100. Any other speculative values that a real person might or might not have are distractions.
        rwallace Apr 3, 2009, 11:44 PM
        2 points
        Parent
        Simple models are fine as long as we don’t forget they are only approximations. Rationalists should win in the real world.
        thomblake Apr 3, 2009, 8:43 PM
        2 points
        Parent
        Except that you mention both persons and promises in the hypothetical example, so both things factor into the correct decision. If you said that it’s not a person making the decision, or that there’s no promising involved, then you could discount integrity.
      - grobstein Apr 3, 2009, 7:29 PM
        1 point
        Parent
        Yes, this seems unimpeachable. The missing piece is, rational at what margin? Once you are home, it is not rational at the margin to pay the $100 you promised.
        randallsquared Apr 3, 2009, 8:08 PM
        2 points
        Parent
        This assumes no one can ever find out you didn’t pay, as well. In general, though, it seems better to assume everything will eventually be found out by everyone. This seems like enough, by itself, to keep promises and avoid most lies.
        grobstein Apr 3, 2009, 8:09 PM
        1 point
        Parent
        Right. The question of course is, “better” for what purpose? Which model is better depends on what you’re trying to figure out.
    - Paul Crowley Apr 3, 2009, 8:02 PM
      1 point
      Parent
      Thank you, I too was curious.
      
      We need names for these positions; I’d use hyper-rationalist but I think that’s slightly different. Perhaps a consequentialist does whatever has the maximum expected utility at any given moment, and a meta-consequentialist is a machine built by a consequentialist which is expected to achieve the maximum overall utility at least in part through being trustworthy to keep commitments a pure consequentialist would not be able to keep.
      
      I guess I’m not sure why people are so interested in this class of problems. If you substitute Clippy for my lift, and up the stakes to a billion lives lost later in return for two billion saved now, there you have a problem, but when it’s human beings on a human scale there are good ordinary consequentialist reasons to honour such bargains, and those reasons are enough for the driver to trust my commitment. Does anyone really anticipate a version of this situation arising in which only a meta-consequentialist wins, and if so can you describe it?
      - grobstein Apr 3, 2009, 8:07 PM
        2 points
        Parent
        I do think these problems are mostly useful for purposes of understanding and (moreso) defining rationality (“rationality”), which is perhaps a somewhat dubious use. But look how much time we’re spending on it.
      - grobstein Apr 3, 2009, 8:05 PM
        2 points
        Parent
        I very much recommend Reasons and Persons, by the way. A friend stole my copy and I miss it all the time.
        Paul Crowley Apr 4, 2009, 8:38 AM
        5 points
        Parent
        OK, thanks!
        
        Your friend stole a book on moral philosophy? That’s pretty special!
        MichaelHoward Apr 5, 2009, 2:18 PM
        3 points
        Parent
        It seems ethics books are more likely to be stolen.
        gjm Apr 3, 2009, 11:35 PM
        2 points
        Parent
        It’s still in print and readily available. If you really miss it all the time, why haven’t you bought another copy?
        grobstein Apr 3, 2009, 11:37 PM
        1 point
        Parent
        It’s $45 from Amazon. At that price, I’m going to scheme to steal it back first.
        
        OR MAYBE IT’S BECAUSE I’M CRAAAZY AND DON’T ACT FOR REASONS!
        gjm Apr 4, 2009, 12:35 AM
        3 points
        Parent
        Gosh. It’s only £17 in the UK.
        
        (I wasn’t meaning to suggest that you’re crazy, but I did wonder about … hmm, not sure whether there’s a standard name for it. Being less prepared to spend X to get Y on account of having done so before and then lost Y. A sort of converse to the endowment effect.)
        Nick_Tarleton Apr 4, 2009, 6:51 AM
        2 points
        Parent
        Mental accounting has that effect in the short run, but seems unlikely to apply here.
- grobstein Apr 3, 2009, 6:14 PM
  1 point
  Parent
  Why don’t you accept his distinction between acting rationally at a given moment and having the disposition which it is rational to have, integrated over all time?
  
  EDIT: er, Parfit’s, that is.