someonewrongonthenet comments on Newcomb’s Problem and Regret of Rationality

someonewrongonthenet 19 Jun 2013 18:38 UTC
3 points
You’re saying that we live in a universe where Newcomb’s problem is impossible because the future doesn’t effect the past. I’ll re-phrase this problem in such a way that it seems plausible in our universe:

I’ve got really nice scanning software. I scan your brain down to the molecule, and make a virtual representation of it on a computer. I run virtual-you in my software, and give virtual-you Newcomb’s problem. Virtual-you answers, and I arrange my boxes according to that answer.

I come back to real-you. You’ve got no idea what’s going on. I explain the scenario to you and I give you Newcomb’s problem. How do you answer?

This particular instance of the problem does have an obvious, relatively uncomplicated solution: Lbh unir ab jnl bs xabjvat jurgure lbh ner rkcrevrapvat gur cneg bs gur fvzhyngvba, be gur cneg bs gur syrfu-naq-oybbq irefvba. Fvapr lbh xabj gung obgu jvyy npg vqragvpnyyl, bar-obkvat vf gur fhcrevbe bcgvba.

If for any reason you suspect that the Predictor can reach a sufficient level of accuracy to justify one-boxing, you one box. It doesn’t matter what sort of universe you are in.
- answer 19 Jun 2013 18:53 UTC
  2 points
  Parent
  Not that I disagree with the one-boxing conclusion, but this formulation requires physically reducible free will (which has recently been brought back into discussion). It would also require knowing the position and momentum of a lot of particles to arbitrary precision, which is provably impossible.
  - someonewrongonthenet 19 Jun 2013 19:56 UTC
    6 points
    Parent
    We don’t need a perfect simulation for the purposes of this problem in the abstract—we just need a situation such that the problem-solver assigns better-than-chance predicting power to the Predictor, and a sufficiently high utility differential between winning and losing.
    
    The “perfect whole brain simulation” is an extreme case which keeps things intuitively clear. I’d argue that any form of simulation which performs better than chance follows the same logic.
    
    The only way to escape the conclusion via simulation is if you know something that Omega doesn’t—for example, you might have some secret external factor modify your “source code” and alter your decision after Omega has finished examining you. Beating Omega essentially means that you need to keep your brain-state in such a form that Omega can’t deduce that you’ll two-box.
    
    As Psychohistorian3 pointed out, the power that you’ve assigned to Omega predicting accurately is built into the problem. Your estimate of the probability that you will succeed in deception via the aforementioned method or any other is fixed by the problem.
    
    In the real world, you are free to assign whatever probability you want to your ability to deceive Omega’s predictive mechanisms, which is why this problem is counter intuitive.
    - Eliezer Yudkowsky 19 Jun 2013 20:29 UTC
      7 points
      Parent
      Also: You can’t simultaneously claim that any rational being ought to two-box, this being the obvious and overdetermined answer, and also claim that it’s impossible for anyone to figure out that you’re going to two-box.
    - answer 19 Jun 2013 20:32 UTC
      5 points
      Parent
      Right, any predictor with at least a 50.05% accuracy is worth one-boxing upon (well, maybe a higher percentage for those with concave functions in money). A predictor with sufficiently high accuracy that it’s worth one-boxing isn’t unrealistic or counterintuitive at all in itself, but it seems (to me at least) that many people reach the right answer for the wrong reason: the “you don’t know whether you’re real or a simulation” argument. Realistically, while backwards causality isn’t feasible, neither is precise mind duplication. The decision to one-box can be rationally reached without those reasons: you choose to be the kind of person to (predictably) one-box, and as a consequence of that, you actually do one-box.
      - someonewrongonthenet 19 Jun 2013 20:48 UTC
        2 points
        Parent
        Oh, that’s fair. I was thinking of “you don’t know whether you’re real or a simulation” as an intuitive way to prove the case for all “conscious” simulations. It doesn’t have to be perfect—you could just as easily be an inaccurate simulation, with no way to know that you are a simulation and no way to know that you are inaccurate with respect to an original.
        
        I was trying to get people to generalize downwards from the extreme intuitive example- Even with decreasing accuracy, as the simulation becomes so rough as to lose “consciousness” and “personhood”, the argument keeps holding.
        answer 19 Jun 2013 21:01 UTC
        3 points
        Parent
        Yeah, the argument would hold just as much with an inaccurate simulation as with an accurate one. The point I was trying to make wasn’t so much that the simulation isn’t going to be accurate enough, but that a simulation argument shouldn’t be a prerequisite to one-boxing. If the experiment were performed with human predictors (let’s say a psychologist who predicts correctly 75% of the time), one-boxing would still be rational despite knowing you’re not a simulation. I think LW relies on computationalism as a substitute for actually being reflectively consistent in problems such as these.
        someonewrongonthenet 19 Jun 2013 22:09 UTC
        2 points
        Parent
        The trouble with real world examples is that we start introducing knowledge into the problem that we wouldn’t ideally have. The psychologist’s 75% success rate doesn’t necessarily apply to you—in the real world you can make a different estimate than the one that is given. If you’re an actor or a poker player, you’ll have a much different estimate of how things are going to work out.
        
        Psychologists are just messier versions of brain scanners—the fundamental premise is that they are trying to access your source code.
        
        And what’s more—suppose the predictions weren’t made by accessing your source code? The direction of causality does matter. If Omega can predict the future, the causal lines flow backwards from your choice to Omega’s past move. If Omega is scanning your brain, the causal lines go from your brain-state to Omega’s decision. If there are no causal lines between your brain/actions and Omega’s choice, you always two-box.
        
        Real world example: what if I substituted your psychologist for a sociologist, who predicted you with above-chance accuracy using only your demographic factors? In this scenario, you aught to two-box—If you disagree, let me know and I can explain myself.
        
        In the real world, you don’t know to what extent your psychologist is using sociology (or some other factor outside your control). People can’t always articulate why, but their intuition (correctly) begins to make them deviate from the given success% estimate as more of these real-world variables get introduced.
        answer 19 Jun 2013 22:29 UTC
        1 point
        Parent
        True, the 75% would merely be a past history (and I am in fact a poker player). Indeed, if the factors used were entirely or mostly comprised of factors beyond my control (and I knew this), I would two-box. However, two-boxing is not necessarily optimal because of a predictor whose prediction methods you do not know the mechanics of. In the limited predictor problem, the predictor doesn’t use simulations/scanners of any sort but instead uses logic, and yet one-boxers still win.
        someonewrongonthenet 19 Jun 2013 22:36 UTC
        3 points
        Parent
        agreed. To add on to this:
        
        predictor doesn’t use simulations/scanners of any sort but instead uses logic, and yet one-boxers still win.
        
        It’s worth pointing out that Newcomb’s problem always takes the form of Simpson’s paradox. The one boxers beat the two boxers as a whole, but among agents predicted to one-box, the two boxers win, and among agents predicted to two-box, the two boxers win.
        
        The only reason to one-box is when your actions (which include both the final decision and the thoughts leading up to it) effect Omega’s prediction. The general rule is: “Try to make Omega think you’re one-boxing, but two-box whenever possible.” It’s just that in Newcomb’s problem proper, fulfilling the first imperative requires actually one-boxing.
        answer 19 Jun 2013 22:42 UTC
        1 point
        Parent
        So you would never one-box unless the simulator did some sort of scan/simulation upon your brain? But it’s better to one-box and be derivable as the kind of person to (probably) one-box than to two-box and be derivable as the kind of person to (probably) two-box.
        
        The only reason to one-box is when your actions (which include both the final decision and the thoughts leading up to it) effect the actual arrangement of the boxes.
        
        Your final decision never affects the actual arrangement of the boxes, but its causes do.
        someonewrongonthenet 19 Jun 2013 22:52 UTC
        4 points
        Parent
        
        So you would never one-box unless the simulator did some sort of scan/simulation upon your brain?
        
        I’d one-box when Omega had sufficient access to my source-code. It doesn’t have to be through scanning—Omega might just be a great face-reading psychologist.
        
        But it’s better to one-box and be derivable as the kind of person to (probably) one-box than to two-box and be derivable as the kind of person to (probably) two-box.
        
        We’re in agreement. As we discussed, this only applies insofar as you can control the factors that lead you to be classified as a one-boxer or a two-boxer. You can alter neither demographic information nor past behavior. But when (and only when) one-boxing causes you to be derived as a one-boxer, you should obviously one box.
        
        Your final decision never affects the actual arrangement of the boxes, but its causes do.
        
        Well, that’s true for this universe. I just assume we’re playing in any given universe, some of which include Omegas who can tell the future (which implies bidirectional causality) - since Psychohistorian3 started out with that sort of thought when I first commented.
        Expand this thread
        answer 19 Jun 2013 22:59 UTC
        4 points
        Parent
        Ok, so we do agree that it can be rational to one-box when predicted by a human (if they predict based upon factors you control such as your facial cues). This may have been a misunderstanding between us then, because I thought you were defending the computationalist view that you should only one-box if you might be an alternate you used in the prediction.
        someonewrongonthenet 19 Jun 2013 23:00 UTC
        3 points
        Parent
        yes, we do agree on that.
      - Decius 19 Jun 2013 21:24 UTC
        1 point
        Parent
        
        any predictor with at least a 50.05% accuracy is worth one-boxing upon
        
        Assuming that you have no information other than the base rate, and that it’s equally likely to be wrong either way.
- frontier64 10 Dec 2020 0:48 UTC
  1 point
  Parent
  An alternate solution which results in even more winning is to cerqvpg gung V znl or va fhpu n fvghngvba va gur shgher. Unir n ubbqyhz cebzvfr gung vs V’z rire va n arjpbzoyvxr fvghngvba gung ur jvyy guerngra gb oernx zl yrtf vs V qba’g 2-obk. Cnl gur ubbqyhz $500 gb frpher uvf cebzvfr. Gura pbzcyrgryl sbetrg nobhg gur jubyr neenatrzrag naq orpbzr n bar-obkre. Fpnaavat fbsgjner jvyy cerqvpg gung V 1-obk, ohg VEY V’z tbvat gb 2-obk gb nibvq zl yrtf trggvat oebxra.
  - Marion Z. 2 Dec 2022 5:24 UTC
    1 point
    Parent
    But you’ve perfectly forgotten about the hoodlum, so you will in fact one box. Or, does the hoodlum somehow show up and threaten you in the moment between the scanner filling the boxes and you making your decision? That seems to add an element of delay and environmental modification that I don’t think exists in the original problem, unless I’m misinterpreting.
    Also, I feel like by analyzing your brain to some arbitrarily precise standard, the scanner could see 3 things: You are (or were at some point in the past) likely to think of this solution, you are/were likely to actually go through with this solution, and the hoodlum’s threat would, in fact, cause you to two-box, letting the scanner predict that you will two-box.