MBlume comments on Rationality is Systematized Winning

MBlume 3 Apr 2009 14:51 UTC
8 points
0
What you give is far harder than a Newcomb-like problem. In Newcomb-like problems, Omega rewards your decisions, he isn’t looking at how you reach them. This leaves you free to optimize those decisions.
- grobstein 3 Apr 2009 15:57 UTC
  2 points
  0
  Parent
  
  What you give is far harder than a Newcomb-like problem. In Newcomb-like problems, Omega rewards your decisions, he isn’t looking at how you reach them.
  
  You misunderstand. In my variant, Omega is also not looking at how you reach your decision. Rather, he is looking at you beforehand—“scanning your brain”, if you will—and evaluating the kind of person you are (i.e., how you “would” behave). This, along with the choice you make, determines your later reward.
  
  In the classical problem, (unless you just assume backwards causation,) what Omega is doing is assessing the kind of person you are before you’ve physically indicated your choice. You’re rewarded IFF you’re the kind of person who would choose only box B.
  
  My variant is exactly symmetrical: he assesses whether you are the kind of person who is rational, and responds as I outlined.
  - Technologos 3 Apr 2009 16:08 UTC
    5 points
    0
    Parent
    We have such an Omega: we just refer to it differently.
    
    After all, we are used to treating our genes and our environments as definite influences on our ability to Win. Taller people tend to make more money; Omega says “there will be $1mil in box B if you have alleles for height.”
    
    If Omega makes decisions based on properties of the agent, and not on the decisions either made or predicted to be made by the agent, then Omega is no different from, well, a lot of the world.
    
    Rationality, then, might be better redefined under these observations as “making the decisions that Win whenever such decisions actually affect one’s probability of Winning,” though I prefer Eliezer’s more general rules plus the tacit understanding that we are only including situations where decisions make a difference.
    - grobstein 3 Apr 2009 16:22 UTC
      3 points
      0
      Parent
      Quoting myself:
      
      (though I don’t see how you identify any distinction between “properties of the agent” and “decisions . . . predicted to be made by the agent” or why you care about it).
      
      I’ll go further and say this distinction doesn’t matter unless you assume that Newcomb’s problem is a time paradox or some other kind of backwards causation.
      
      This is all tangential, though, I think.
    - grobstein 3 Apr 2009 16:21 UTC
      1 point
      0
      Parent
      Yes, all well and good (though I don’t see how you identify any distinction between “properties of the agent” and “decisions . . . predicted to be made by the agent” or why you care about it). My point is that a concept of rationality-as-winning can’t have a definite extension say across the domain of agents, because of the existence of Russell’s-Paradox problems like the one I identified.
      
      This is perfectly robust to the point that weird and seemingly arbitrary properties are rewarded by the game known as the universe. Your proposed redefinition may actually disagree with EY’s theory of Newcomb’s problem. After all, your decision can’t empty box B, since the contents of box B are determinate by the time you make your decision.
      - major 3 Apr 2009 18:06 UTC
        0 points
        0
        Parent
        
        After all, your decision can’t empty box B, since the contents of box B are determinate by the time you make your decision.
        
        Hello. My name is Omega. Until recently I went around claiming to be all-knowing/psychic/whatever, but now I understand lying is Wrong, so I’m turning over a new leaf. I’d like to offer you a game.
        
        Here are two boxes. Box A contains $1,000, box B contains $1,000,000. Both boxes are covered by touch-sensitive layer. If you choose box B only (please signal that by touching box B), it will send out a radio signal to box A, which will promptly disintegrate. If you choose both boxes (please signal that by touching box A first), a radio signal will be sent out to box B, which will disintegrate it’s content, so opening it will reveal an empty box.
        
        (I got the disintegrating technology from the wreck of a UFO that crashed into my barn, but that’s not relevant here.)
        
        I’m afraid, if I or my gadgets detect any attempt to temper with the operation of my boxes, I will be forced to disqualify you.
        
        In case there is doubt, this is the same game I used to offer back in my deceitful days. The difference is, now the player knows the rules are enforced by cold hard electronics, so there’s no temptation to try and outsmart anybody.
        
        So, what will it be?
        grobstein 3 Apr 2009 18:40 UTC
        1 point
        0
        Parent
        Yes, you are changing the hypo. Your Omega dummy says that it is the same game as Newcomb’s problem, but it’s not. As VN notes, it may be equivalent to the version of Newcomb’s problem that assumes time travel, but this is not the classical (or an interesting) statement of the problem.
        Vladimir_Nesov 3 Apr 2009 18:29 UTC
        1 point
        0
        Parent
        What is your point? You seem to be giving a metaphor for solving the problem by imagining that your action has a direct consequence of changing the past (and as a result, contents of the box in the present). More about that in this comment.
        major 3 Apr 2009 22:42 UTC
        3 points
        0
        Parent
        Naive argument coming up.
        
        How Omega decides what to predict or what makes it’s stated condition for B (aka. result of “prediction”) come true, is not relevant. Ignoring the data that says it’s always/almost always correct, however, seems … not right. Any decision must be made with the understanding that Omega is most likely to predict it. You can’t outsmart it by failing to update it’s expected state of mind in the last second. The moment you decide to two-box is the moment Omega predicted, when it chose to empty box B.
        
        Consider this:
        
        Andy: “Sure, one box seems like the good choice, because Omega would take the million away otherwise. OK. … Now that the boxes are in front of me, I’m thinking I should take both. Because, you know, two is better than one. And it’s already decided, so my choice won’t change anything. Both boxes.”
        
        Barry: “Sure, one box seems like the good choice, because Omega would take the million away otherwise. OK. … Now that the boxes are in front of me, I’m thinking I should take both. Because, you know, two is better than one. Of course the outcome still depends on what Omega predicted. Say I choose both boxes. So if Omega’s prediction is correct this time, I will find an empty B. But maybe Omega was wrong THIS time. Sure, and maybe THIS time I will also win the lottery. How it would have known is not relevant. The fact that O already acted on it’s prediction doesn’t make it more likely to be wrong. Really, what is the dilemma here? One box.”
        
        Ok, I don’t expect that I’m the first person to say all this. But then, I wouldn’t have expected anybody to two-box, either.
        HughRistik 3 Apr 2009 23:01 UTC
        1 point
        0
        Parent
        major said:
        
        Ignoring the data that says it’s always/almost always correct, however, seems … not right.
        
        You’re not the only person to wonder this. Either I’m missing something, or two-boxers just fail at induction.
        
        I have to wonder how two-boxers would do on the “Hot Stove Problem.”
        
        In case you guys haven’t heard of such a major problem in philosophy, I will briefly explain the Hot Stove Problem:
        
        You have touched a hot stove 100 times. 99 times you have been burned. Nothing has changed about the stove that you know about. Do you touch it again?
        thomblake 4 Apr 2009 0:02 UTC
        1 point
        0
        Parent
        I can see the relation to Newcomb—this is also a weird counterfactual that will never happen. I haven’t deliberately touched a hot stove in my adult life, and don’t expect to. I certainly won’t get to 99 times.