Shmi comments on Why do theists, undergrads, and Less Wrongers favor one-boxing on Newcomb?

Shmi 20 Jun 2013 17:13 UTC
0 points
In Newcomb the outcome “pick two boxes, get $1.001M” is not in the outcome space, unless you fight the hypothetical, so the properly restricted CDT one-boxes. In the payoff matrix [1000, 0; 1001000, 1000000] the off-diagonal cases are inconsistent with the statement that Omega is a perfect predictor, so if you take them into account, you are not solving Newcomb, but some other problem where Omega is imperfect with unknown probability. Once the off-diagonal outcomes are removed, CDT trivially agrees with EDT.
- CarlShulman 20 Jun 2013 17:28 UTC
  3 points
  Parent
  First, removal of those scenarios is inconsistent with CDT as it is normally interpreted: CDT evaluates the utility of an act by the expected outcome of an exogenous choice being set without dependence on past causes, i.e. what would happen if a force from some unanticipated outside context came in and forced you to one-box or two-box, regardless of what you would otherwise have done.
  
  It doesn’t matter if the counterfactual computed in this way is unphysical, at least without changing the theory.
  
  Second, to avoid wrangling over this, many presentations add small or epsilon error rates (e.g. the Predictor flips a weighted coin to determine whether to predict accurately or inaccurately, and is accurate 99% of the time, or 99.999999% of the time). What’s your take with that adjustment?
  - Shmi 20 Jun 2013 18:58 UTC
    0 points
    Parent
    Are you saying that the “CDT as it is normally interpreted” cannot help but fight the hypothetical? Then the Newcomb problem with a perfect predictor is not one where such CDT can be applied at all, it’s simply not in the CDT domain. Or you can interpret CDT as dealing with the possible outcomes only, and happily use it to one-box.
    
    In the second case, first, you assume the existence of the limit if you extrapolate from imperfect to perfect predictor, which is a non-trivial mathematical assumption of continuity and is not guaranteed to hold in general (for example, a circle, no matter low large, is never topologically equivalent to a line).
    
    That notwithstanding, CDT does take probabilities into account, at least the CDT as described in Wikipedia, so the question is, what is the counterfactual probability that if I were to two-box, then I get $1.001M, as opposed to the conditional probability of the same thing. The latter is very low, the former has to be evaluated on some grounds.
    
    The standard two-boxer reasoning is that
    
    if the prediction is for both A and B to be taken, then the player’s decision becomes a matter of choosing between $1,000 (by taking A and B) and $0 (by taking just B), in which case taking both boxes is obviously preferable. But, even if the prediction is for the player to take only B, then taking both boxes yields $1,001,000, and taking only B yields only $1,000,000—taking both boxes is still better, regardless of which prediction has been made.
    
    Unpacking this logic, I conclude that “even if the prediction is for the player to take only B, then taking both boxes yields $1,001,000, and taking only B yields only $1,000,000—taking both boxes is still better” means assigning equal conterfactual probability to both outcomes, which goes against the problem setup, as it discards the available information (“it does not matter what omega did, the past is past, let’s pick the dominant strategy”). This also highlights the discontinuity preventing one from taking this “information-discarding CDT” limit. This is similar to the information-discarding EDT deciding to not smoke in the smoking lesion problem.
    - CarlShulman 20 Jun 2013 22:21 UTC
      2 points
      Parent
      
      Are you saying that the “CDT as it is normally interpreted” cannot help but fight the hypothetical? It doesn’t have to fight the hypothetical. CDT counterfactuals don’t have to be possible.
      
      The standard CDT algorithm computes the value of each action by computing the expected utility conditional on a miraculous intervention changing one’s decision to that action, separately from early deterministic causes, and computing the causal consequences of that. See Anna’s discussion here, including modifications in which the miraculous intervention changes other things, like one’s earlier dispositions (perhaps before the Predictor scanned you) or the output of one’s algorithm (instantiated in you and the Predictor’s model).
      
      Say before the contents of the boxes are revealed our CDTer assigns some probability p to the state of the world where box B is full and his internal makeup will deterministically lead him to one-box, and probability (1-p) to the state of the world where box B is empty and that his internal makeup will deterministically lead him to two-box.
      
      That notwithstanding, CDT does take probabilities into account, at least the CDT as described in Wikipedia, so the question is, what is the counterfactual probability that if I were to two-box, then I get $1.001M, as opposed to the conditional probability of the same thing. The latter is very low, the former has to be evaluated on some grounds.
      
      Altering your action miraculously and exogenously would not change the box contents causally. So the CDTer uses the old probabilities for the box contents, the utility of one-boxing is computed to be $1,000,000 times p, and the utility of two boxing is calculated to be $1,001,000p+$1,000 times (1-p).
      
      If she is confident that she will apply CDT based on past experience, or introspection, she will have previously updated to thinking that p is very low.
      - Shmi 20 Jun 2013 22:55 UTC
        0 points
        Parent
        
        utility of one-boxing is computed to be $1,000,000 times p,
        
        utility of two boxing is calculated to be $1,001,000p+$1,000 times (1-p).
        
        If she is confident that she will apply CDT based on past experience, or introspection, she will have previously updated to thinking that p is very low.
        
        Right, I forgot. The reasoning is “I’m a two-boxer because I follow a loser’s logic and Omega knows it, so I may as well two-box.” There is no anticipation of winning $1,001,000. No, that does not sound quite right...
        CarlShulman 20 Jun 2013 23:01 UTC
        2 points
        Parent
        The last bit about p going low with introspection isn’t necessary. The conclusion (two-boxing preferred, or at best indifference between one-boxing and two-boxing if one is certain one will two-box) follows under CDT with the usual counterfactuals for any value of p.
        
        The reasoning is “well, if the world is such that I am going to two-box, then I should two-box, and if the world is such that I am going to one-box, then I should two-box” Optional extension: “hmm, sounds like I’ll be two-boxing then, alas! No million dollars for me...” (Unless I wind up changing my mind or the like, which keeps p above 0).
- PhilosophyStudent 20 Jun 2013 23:12 UTC
  0 points
  Parent
  CDT doesn’t assign credences to outcomes in the way you are suggesting.
  
  One way to think about it is as follows: Basically CDT says that you should use your prior probability in a state (not an outcome) and update this probability only in those cases where the decision being considered causally influences the state. So whatever prior credence you had in the “box contains $M” state, given that the decision doesn’t causally influence the box contents, you should have that same credence regardless of decision and same for the other state.
  
  There are so many different ways of outlining CDT that I don’t intend to discuss why the above account doesn’t describe each of these versions of CDT but some equivalent answer to that above will apply to all such accounts.