Kingreaper comments on Newcomb’s Problem: A problem for Causal Decision Theories

Kingreaper 16 Aug 2010 16:01 UTC
0 points
The categorisation possibility is reasonable. Personally I would estimate the probability of 99% accuracy achieved through categorisation lower than the probability of 99% accuracy achieved through mental simulation, but it’s certainly a competitive hypothesis.
- FAWS 16 Aug 2010 16:33 UTC
  0 points
  Parent
  Assuming she tells you that she predicted your actions through some unspecified mechanism other than imagining your thought process in sufficient detail for the imagined version to ask itself whether it just exists in her imagination, what do you do?
  - Kingreaper 16 Aug 2010 16:42 UTC
    2 points
    Parent
    I question what reason I have to assume she’s being honest, and is in fact correct.
    
    Given her psychological genius she is likely correct about the methods she used, although not certainly (she may not be good at self-analysis).
    
    If I conclude that: either A) she is being honest or B) the whole pay-off is a lie Then I will probably act on the second most plausible (to my mind) scenario. I’ve yet to work out what that is. Repeating the experiment often enough to get statistics that are precise enough for 99% accuracy would be extremely costly with the standard pay-out scheme; so while I jumped towards that as my secondary scenario it’s actually very implausible.
    - FAWS 16 Aug 2010 17:11 UTC
      0 points
      Parent
      Reduce both payoffs by a factor of 100.
      
      The psychologist is hooked up to a revolutionary lie detector that is 99% reliable, there is the standing price of $ 1,000,000,000 for anyone who can after calibration deceive it on more than 10 out of 50 statements (with no further calibration during the trial). The psychologist is known to have tried the test three times and failed (with 1, 4, and 3 successful deceptions).
      - Kingreaper 16 Aug 2010 17:53 UTC
        2 points
        Parent
        Well, the psychologist’s track record of successful lying is within a plausible range of the 99% reliability.
        
        With the payoffs decreased by a factor of 100, and the lie detector added in, my best guess would be that she’s repeated the experiment often, and gathered up a statistical model of people to which she can compare me, and to which I will be added. In such a circumstance I think I would still tend to one-box, but the reason is slightly different.
        
        I value the wellbeing of people who are like me. If I one-box, others like me will be more likely to receive the $10,000; rather than just the $10
        FAWS 16 Aug 2010 18:19 UTC
        1 point
        Parent
        Are you sure you are actually trying to make a valid defense of CDT and not just looking for excuses?
        
        What would you do if that somehow were not a consideration? (What would you do if you were more selfish, what would an otherwise identical more selfish simulation of you do, what would you do if you could be reasonably sure that you won’t affect the payoff for anyone else you would care about for some reason that doesn’t change your estimation of the accuracy of the prediction and the way it came about [e. g. you are the last subject and everyone before you for whom it would matter was asked what they would have done if they had been the last subject]?)
        Kingreaper 16 Aug 2010 19:31 UTC
        0 points
        Parent
        Are you sure you’re not just trying to destroy CDT rather than think rationally? If you think I am being irrationally defensive of CDT, check the OTHER thread off my first reply. You seem to be trying very hard indeed to tear down CDT.
        
        CDT gives the correct result in the original posted scenario, for reasons which are not immediately obvious but are none-the-less present. You appear to have accepted that, what with your gradually moving further and further from the original scenario.
        
        In your scenario, designed specifically to make CDT not work, it would still work for me, because of who I am.
        
        If I was more selfish, I don’t see CDT working in your scenario. If there is a reason why it should work, I haven’t realised it. But then, it’s a scenario contrived with the specific intention of CDT not working.
        
        Your “everyone was the last subject” scenario breaks down somewhat; if everyone is told they are the last subject then I can’t take being told that I’m the last subject seriously. If I AM the last subject, I will be extremely skeptical, given the sample-size I expect to be needed for the 99% accuracy, and thus I will tend to behave as though I am not the last subject due to not believing I am the last subject.
        
        My original point was simply that the starting post, while claiming to show problems with CDT, failed. It used a scenario that didn’t illustrate any problem with CDT. Do you still disagree with my original point?
        
        EDIT: You seem to think that I’m doing my best to defend CDT. I’m really not, I have no major vested interest in defending CDT except when it was unfairly attacked. Adambell has posted two scenarios where CDT works fine, with claims that CDT doesn’t work in those scenarios.
        FAWS 16 Aug 2010 19:47 UTC
        −1 points
        Parent
        Almost everyone agrees that CDT two-boxes in the original scenario, both proponents and opponents of CDT. The only way to make CDT “work” are excuses that are completely irrelevant to the original point of the scenario and amount to deliberately understand the scenario as different than intended. This discussion thread has shown that the existence of such excuses is not implied by the structure of the problem, so any issues with a particular formulation are irrelevant. It’s sort of like arguing that EDT is right in the smoke lesion problem because any evidence that smoking and cancer are caused by lesions rather than cancer by smoking would be dubious and avoiding smoking just to be sure would be prudent.
        Kingreaper 16 Aug 2010 20:00 UTC
        2 points
        Parent
        So because I disagree with your consensus, my rational objection must be wrong?
        
        I didn’t change the scenario. I looked at the scenario, and asked what someone applying CDT rationally, who understood that it’s impossible to tell whether you’re being simulated or not, would do. And, as it happened, I got the answer “they would one-box, because they’re probably a simulation”.
        
        If I posted a scenario where an EDT person would choose to walk through a minefield, because they’ve never seen anyone walk through a minefield and thus don’t consider walking through a minefield to be evidence that they won’t live much longer, would you not think my scenario-crafting skills were a bit weak?
        FAWS 16 Aug 2010 20:21 UTC
        0 points
        Parent
        
        So because I disagree with your consensus, my rational objection must be wrong?
        
        Not wrong, beside the point. Objections like that don’t touch the core of the problem at all. Finding clever ways for decision theory differences in example cases not to matter doesn’t change the validity of the decision theories.
        
        Your mine field example is different in that the original formulation of Newcomb’s problem gets the point across for almost everyone while I’m not sure what the point in the mine field example would be. That EDT would be even stupider than it already is if it restricted what kinds of evidence could be considered? Well, yes, of course. I won’t defend EDT, it’s wronger than CDT (though at least a bit better defined).
        What links here?
        How can we compare decision theories? by bentarm (18 Aug 2010 13:29 UTC; 9 points)
        Kingreaper 16 Aug 2010 20:30 UTC
        2 points
        Parent
        
        Not wrong, beside the point. Objections like that don’t touch the core of the problem at all. Finding clever ways for decision theory differences in example cases not to matter doesn’t change the validity of the decision theories.
        
        CDT is seemingly imperfect. I have acknowledged such.
        
        But pointing to CDT as failing when it doesn’t fail doesn’t help. Pointing to where it DOES fail helps.
        
        When I see someone getting the right answer for the wrong reason I criticise their reasoning.
        
        The point you should take away from newcomb’s paradox isn’t that CDT fails (in some formulations it seems to, in others it’s just hard to apply) it’s that CDT is really hard to apply, so using something that gets the right answer easily is better.
        Expand this thread
        FAWS 16 Aug 2010 21:12 UTC
        0 points
        Parent
        Newcomb’s problem tries to show that CDT only caring about things caused by your decisions afterwards can be a weakness by providing an example where things caused by accurate predictions of your decisions outweight those things. Everything else is just window dressing. You are using the window dressing to explain how you care about these other things caused by the decision, so you coincidentally act just as if you also cared about the causes of accurate predictions of your decisions. But as long as you make out the things caused by the decision that should, according to the intention of the problem statement, cause the less desirable things afterwards actually cause more desirable things afterwards you are not addressing Newcomb’s problem. You are just showing that what is a particular formulation of Newcomb’s problem for most people isn’t a formulation of Newcomb’s problem for you. In a way that doesn’t generalize.
        Kingreaper 16 Aug 2010 21:36 UTC
        0 points
        Parent
        The “accurate prediction” is a central part of Newcomb’s problem. The issue of whether it’s possible (I feel it is) and IN WHAT WAYS it is possible, are central to the validity of Newcomb’s problem.
        
        If all possible ways of the accurate prediction were to make CDT work, then Newcomb’s problem wouldn’t be a problem for CDT. (apart from the practical one of it being hard to apply correctly)
        
        At present, it seems like there are possible ways that make CDT work, and possible ways that make CDT not work. If it were to someday be proved that all possible ways make CDT work, that would be a major proof. If it were to be proved (beyond all doubt) that a possible way was completely incompatible with CDT, that could also be important for AI creation.
        wedrifid 16 Aug 2010 20:43 UTC
        0 points
        Parent
        
        I will admit, given the prevalence of CDT users to fail in this scenario, my objection isn’t that strong, CDT tends to lead people to the wrong answer in this scenario, so it’s not useful to them.
        
        I suggest that the way you use ‘CDT’ is actually a hop and a jump in the direction of TDT. When you already have a box containing $1,000,000 in your hand you are looking at a $10,000 sitting on the table and deciding not to take it. Even though you know that nothing you do now has any way of causing the money you already have to disappear. Pure CDT agents just don’t do that.
        Kingreaper 16 Aug 2010 20:54 UTC
        2 points
        Parent
        If you don’t know whether you’re a simulation or not, you don’t know whether or not your taking the second box will cause the real-world money not to be there. And, as a simulation, you probably won’t get to spend any of that sim-world money you’ve got there.
        
        To be fair, I don’t particularly use CDT consciously, because it seems to be flawed somehow (or at least, harder to use than intuition, and I’m lazy). But I came across newcomb’s paradox, thought about it, and realised that in the traditional formulation I’m probably a simulation.
        
        I don’t see why realising I’m probably a simulation is something a CDT agent can’t do?
        wedrifid 17 Aug 2010 4:30 UTC
        0 points
        Parent
        
        If you don’t know whether you’re a simulation or not, you don’t know whether or not your taking the second box will cause the real-world money not to be there. And, as a simulation, you probably won’t get to spend any of that sim-world money you’ve got there.
        
        Replace ‘Omega’ with Patrick Jane. No sims. What do you do?
        Kingreaper 17 Aug 2010 22:36 UTC
        2 points
        Parent
        A) I one-box. I will one-box in most reasonable scenarios.
        
        B)How do you predict other people’s actions?
        
        Personally, I mentally simulate them. Not particularly well, mind, but I do mentally simulate them. Am I unusual in this?
        
        I’ve never watched the Mentalist, but if Patrick Jane is sufficiently good to get a 99% success rate, I’m guessing his simulations are pretty damn good.
        arundelo 17 Aug 2010 22:51 UTC
        1 point
        Parent
        Patrick Jane is a fictional character in the TV show The Mentalist. He’s a former (fake) psychic who now uses his cold reading skills to fight crime.
        Kingreaper 17 Aug 2010 23:38 UTC
        2 points
        Parent
        Cheers, had been looking that up, oddly my edit to my post didn’t seem to update it.