SoullessAutomaton comments on Rationality is Systematized Winning

SoullessAutomaton Apr 3, 2009, 11:11 PM
4 points

Correct me if I’m wrong, but pre-commitment isn’t an option in Newcomb’s problem, so the best, the most rational, the winning decision is two-boxing.

By construction, Omega’s predictions are known to be essentially infallible. Given that, whatever you choose, you can safely assume Omega will have correctly predicted that choice. To what extent, then, is pre-commitment distinguishable from deciding on the spot?

In a sense there is an implicit pre-commitment in the structure of the problem; while you have not pre-committed to a choice on this specific problem, you are essentially pre-committed to a decision-making algorithm.

Eliezer’s argument, if I understand it, is that any decision-making algorithm that results in two-boxing is by definition irrational due to giving a predictably bad outcome.
- orthonormal Apr 4, 2009, 12:59 AM
  1 point
  Parent
  
  In a sense there is an implicit pre-commitment in the structure of the problem; while you have not pre-committed to a choice on this specific problem, you are essentially pre-committed to a decision-making algorithm.
  
  That’s an interesting, and possibly fruitful, way of looking at the problem.
- Furcas Apr 3, 2009, 11:30 PM
  1 point
  Parent
  Pre-commitment is different from deciding on the spot because once you’re on the spot, there is nothing, absolutely nothing you can do to change what’s in box B. It’s over. It’s a done deal. It’s beyond your control.
  
  My understanding of Eliezer’s argument is the same as yours. My objection is that two-boxing doesn’t actually give a bad outcome. It gives the best outcome you can get given the situation you’re in. That you don’t know what situation you’re in until after you’ve opened box B doesn’t change that fact. As Eliezer is so fond of saying, the map isn’t the territory.
  - SoullessAutomaton Apr 4, 2009, 12:59 AM
    1 point
    Parent
    
    Pre-commitment is different from deciding on the spot because once you’re on the spot, there is nothing, absolutely nothing you can do to change what’s in box B.
    
    If your decision on the spot is 100 percent predictable ahead of time, as is explicitly assumed in the problem construction, you are effectively pre-committed as far as Omega is concerned. You, apparently, have essentially pre-committed to opening two boxes.
    
    My objection is that two-boxing doesn’t actually give a bad outcome. It gives the best outcome you can get given the situation you’re in.
    
    And yet, everyone who opens one box does better than the people who open two boxes.
    
    You seem to have a very peculiar definition of “best outcome”.
    - Furcas Apr 4, 2009, 2:03 AM
      0 points
      Parent
      
      If your decision on the spot is 100 percent predictable ahead of time, as is explicitly assumed in the problem construction, you are effectively pre-committed as far as Omega is concerned. You, apparently, have essentially pre-committed to opening two boxes.
      
      What I meant by ‘pre-commitment’ is a decision that we can make if and only if we know about Newcomb-like problems before being faced with one. In other words, it’s a decision that can affect what Omega will put in box B. That Omega can deduce what my decision will be doesn’t mean that the decision is already taken.
      
      And yet, everyone who opens one box does better than the people who open two boxes.
      
      And every fatass who competes against an Olympic athlete in the scenario I described above does ‘better’ than the athlete. So what? Unless the athlete knows about the competition’s rules ahead of time and eats non-stop to turn himself into a fatass, there’s not a damn thing he can do about it, except try his best once the competition starts.
      
      You seem to have a very peculiar definition of “best outcome”.
      
      It seems too obvious to say, but I guess I have to say it. “The best outcome” in this context is “the best outcome that it is possible to achieve by making a decision”. If box B contains nothing, then the best outcome that it is possible to achieve by making a decision is to win a thousand dollars. If box B contains a million dollars, then the best outcome that it is possible to achieve by making a decision is to win one million and one thousand dollars.
      
      Well, I don’t see how I can explain myself more clearly than this, so this will be my last comment on this subject. In this thread. This week. ;)
      - Kenny Apr 12, 2009, 4:59 PM
        1 point
        Parent
        This exchange has finally imparted a better understanding of this problem for me.
        
        If you pre-commit now to always one-box – and you believe that about yourself – then deciding to one-box when Omega asks you is the best decision.
        
        If you are uncertain of your commitment then you probably haven’t really pre-committed! I haven’t tried to math it, but I think your actual decision when Omega arrives would depend on the strength of your belief about your own pre-commitment. [Though a more-inconvenient possible world is the one in which you’ve never heard of this, or similar, puzzles!]
        
        Now I grok why rationality should be self-consistent under reflection.
        Furcas Apr 13, 2009, 2:46 AM
        0 points
        Parent
        Small nitpick: If you’ve really pre-committed to one-boxing, there is no decision to be made once Omega has set up the boxes. In fact, the thought of making a decision won’t even cross your mind. If it does cross your mind, you should two-box. But if you two-box, you now know that you haven’t really pre-committed to one-boxing. Actually, even if you decide to (mistakenly) one-box, you’ll still know you haven’t really pre-committed, or you wouldn’t have had to decide anything on the spot.
        
        In other words, Newcomb’s problem can only ever involve a single true decision. If you’re capable of pre-commitment (that is, if you know about Newcomb-like problems in advance and if you have the means to really pre-commit), it’s the decision to pre-commit, which precludes any ulterior decision. If you aren’t capable of pre-commitment (that is, if at least one of the above conditions is false), it’s the on-the-spot decision.
- grobstein Apr 3, 2009, 11:26 PM
  −1 points
  Parent
  
  Eliezer’s argument, if I understand it, is that any decision-making algorithm that results in two-boxing is by definition irrational due to giving a predictably bad outcome.
  
  So he’s assuming the conclusion that you get a bad outcome? Golly.
  - HughRistik Apr 3, 2009, 11:35 PM
    3 points
    Parent
    True, we don’t know the outcome. But we should still predict that it will be bad, due to Omega’s 99% accuracy rate.
    
    Don’t mess with Omega.
  - William Apr 3, 2009, 11:31 PM
    1 point
    Parent
    The result of two-boxing is a thousand dollars. The result of one-boxing is a million dollars. By definition, a mind that always one-boxes receives a better payout than one that always two-boxes, and therefore one-boxing is more rational, by definition.
    - orthonormal Apr 4, 2009, 1:36 AM
      1 point
      Parent
      See Arguing “By Definition”. It’s particularly problematic when the definition of “rational” is precisely what’s in dispute.
    - Furcas Apr 3, 2009, 11:41 PM
      1 point
      Parent
      The result of two-boxing is a thousand dollars more than you would have gotten otherwise. The result of one-boxing is a thousand dollars less than you would have gotten otherwise. Therefore two-boxing is more rational, by definition.
      
      What determines whether you’ll be in a 1M/1M+1K situation or in a 0/1K situation is the kind of mind you have, but in Newcomb’s problem you’re not given the opportunity to affect what kind of mind you have (by pre-commiting to one-boxing, for example), you can only decide whether to get X or X+1K, regardless of X’s value.
      - SarahNibs Apr 4, 2009, 12:48 AM
        2 points
        Parent
        Suppose for a moment that one-boxing is the Foo thing to do. Two-boxing is the expected-utility-maximizing thing to do. Omega decided to try to reward those minds which it predicts will choose to do the Foo thing with a decision between doing the Foo thing and gaining $1000000, and doing the unFoo thing and gaining $1001000, while giving those minds which will choose to do the unFoo thing a decision between doing the Foo thing and gaining $0 and doing the unFoo thing and gaining $1000.
        
        The relevant question is whether there is a generalization of the computation Foo which we can implement that doesn’t screw us over on all sorts of non-Newcomb problems. Drescher for instance claims that acting ethically implies, among other things, doing the Foo thing, even when it is obviously not the expected-utility-maximizing thing.