Zack_M_Davis comments on 2013 Less Wrong Census/Survey

Zack_M_Davis 22 Nov 2013 3:34 UTC
33 points
For the Prize Question, you should use a random number generator and cooperate with probability 0.8. Why? Suppose that the fraction of survey-takers that cooperate is p. Then the value of the prize will be proportional to p and there will be p + 4(1 - p) raffle entries. The expected value of Cooperating is p/(p + 4(1-p)) and the expected value of Defecting is 4(1-p)/(p + 4(1-p)). In equilibrium, these must be the same: if one choice were more profitable than the other, then people would switch until this was no longer the case. Thus p = 4(1 - p) and thus p = ⁴⁄₅.

Addendum 29 November: Actually, this is wrong; see ensuing discussion.
What links here?
- Zack_M_Davis's comment on 2013 Survey Results by Scott Alexander (19 Jan 2014 0:47 UTC; 27 points)
- ThrustVectoring 22 Nov 2013 4:44 UTC
  15 points
  Parent
  The expected value of defecting is 4p/(p + 4(1-p), to within one part in the number of survey takers. Whether or not you defect makes no difference as to the proportion of people who defect.
  
  The solution is to determine how likely it is that a random participant is going to defect, conditional on your choice of cooperate or defect. If you’re playing with a total of N copies of yourself, you cooperate and get the maximal payout ($60/N). If you’re playing against cooperate bots, you defect and get $60*4N/(N-1).
  
  We can generalize this to partial levels. If you play with D defectors and C cooperators whose opinion you can’t change, and X people who will cooperate when you cooperate (and defect when you defect), then the payouts are as thus:
  
  C: (C + X)/(C + D + X) D: 4(C /(C + D + X)
  
  You can solve for the break even point by setting C + X = 4 * C
  
  So the answer is that you should defect, unless you think that for every person who is going to cooperate no matter what, there are at least three people who are thinking with similar enough reasoning to come up with the same answer you come up with (regardless of what answer that is).
  - Oscar_Cunningham 25 Nov 2013 22:56 UTC
    1 point
    Parent
    I think you’ve got the denominators of your fractions wrong. There are 4 raffle tickets for everyone who defects. I get the values
    
    C: (C + X)/(C + 4D + X) D: 4(C /(C + 4D + 4X)
    
    which solves to a horrible quadratic surd.
    
    If we wanted to we could combine your method with Zack’s and assume that C people cooperate, D defect and X make the same choice I do, which is to cooperate with probability p. I think this gets kinda ugly though.
    - ThrustVectoring 26 Nov 2013 0:44 UTC
      3 points
      Parent
      The fractions I wrote are payout * number of tickets, not the chance of winning. But you do have a point: changing many people from cooperate to defect does dilute the total pool of tickets, and not be an unnoticeable amount.
      
      The corrected answer is Payout * Chance to win, which is:
      
      C: (C + X)/(C + D + X) * (1 / (C + 4D + X)
      
      D: (C/(C+D+C)) * (4 / (C + 4D + 4X))
      
      And you don’t want to combine my method with Zack’s. You don’t want a probabilistic strategy—you want to figure out what your beliefs are as far as “how many people do I expect to be in categories C, X, and D”. Given your beliefs about how your choices affect others, there’s exactly one right choice.
      - Oscar_Cunningham 26 Nov 2013 16:05 UTC
        3 points
        Parent
        (By the way, the numbers I gave are the same as the ones you gave, only I cancelled a common factor of (C+D+X))
        
        And you don’t want to combine my method with Zack’s. You don’t want a probabilistic strategy—you want to figure out what your beliefs are as far as “how many people do I expect to be in categories C, X, and D”. Given your beliefs about how your choices affect others, there’s exactly one right choice.
        
        I think that your “one right choice” might sometimes be a probabilistic one. To make this more obvious, consider a game where the value of the prize is maximal when exactly half of the participants choose C, and the value goes down as the proportion gets further from a half (and any of the participants is equally likely to win the prize). Then I think it’s obvious that the correct strategy is to estimate C, D, and X as before, and then cooperate with probability p so that C+pX=D+(1-p)X. Then because everyone else in X acts as you do you’ll end up with exactly half the people choosing C, which is what you want.
        
        Note that even some of the people in X who you are “acausally controlling” still end up choosing a different option from you (assuming that your random number generators are independent). This allows you to exactly optimise the proportion of people who choose C, which is what makes the strategy work.
        
        I think the same thing applies in Yvain’s game. In particular, if we thought that C=D=0 then I think that Zack’s analysis is exactly correct (although I wouldn’t have used exactly the same words as he does).
        
        EDIT: I retract the last sentence. Zack’s calculation isn’t what you want to do even in the C=D=0 case. In that case I endorse cooperating with p=1. But I still think that mixed strategies are best in some of the cases with C or D non-zero. In particular what about the case with D=0 but C=X? Then I reckon you should pick C with p=0.724.
        Zack_M_Davis 30 Nov 2013 5:57 UTC
        6 points
        Parent
        I think this is it. Suppose there are C CooperateBots, D DefectBots, and X players who Cooperate with probability p. The expected utility of the probabilistic strategy is (proportional to) (p(C + pX) + 4(1-p)(C + pX))/(C + 4D + pX + 4(1-p)X). Then (he said, consulting his computer algebra system) if C/X < ¹⁄₃ then p = 1 (Cooperate), if C/X > 3 then p = 0 (Defect), and p assumes intermediate values if ¹⁄₃ < C/X < 3 (including 0.7239 if C/X = 1, as you mention).
        What links here?
        Zack_M_Davis's comment on 2013 Less Wrong Census/Survey by Scott Alexander (22 Nov 2013 3:34 UTC; 33 points)
  - hylleddin 22 Nov 2013 9:45 UTC
    0 points
    Parent
    
    The expected value of defecting is 4p/(p + 4(1-p), to within one part in the number of survey takers. Whether or not you defect makes no difference as to the proportion of people who defect.
    
    Unless you’re using timeless decision theory, if I understand TDT correctly (which I very well might not). In that case, the calculations by Zack show the amount of causal entanglement for which cooperation is a good choice. That is, P(others cooperate | I cooperate) and P(others defect | I defect) should be more than 0.8 for cooperation to be a good idea.
    
    I do not think my decisions have that level of causal entanglement with other humans, so I defected.
    
    Though, I just realized, I should have been basing my decision on my entanglement with lesswrong survey takers, which is probably substantially higher. Oh well.
    - Oscar_Cunningham 26 Nov 2013 16:40 UTC
      0 points
      Parent
      
      Though, I just realized, I should have been basing my decision on my entanglement with lesswrong survey takers, which is probably substantially higher. Oh well.
      
      I defected for the same reasons as you. We’re entangled! Reading the responses of the other survey takers I think it’s clear that very few people are entangled with us, so we did indeed make the right choice!
    - hylleddin 22 Nov 2013 9:55 UTC
      0 points
      Parent
      Nevermind, you already covered this, though in a different fashion.
      - ThrustVectoring 22 Nov 2013 15:30 UTC
        2 points
        Parent
        Yeah, and the math is a little different, three entangled decision makers for each cooperate-bot you can defect against (the number of defectors don’t matter, surprisingly). You get three extra chances to get the money generously donated to the pool by the cooperate bots by defecting, compared to causing a certain number of people to help you make the pool even larger.
- stevko 22 Nov 2013 12:03 UTC
  4 points
  Parent
  I think that way to get maximum reward is doing the survey (at least) four times and always answering cooperate.

Zack_M_Davis comments on 2013 Less Wrong Census/​Survey

Zack_M_Davis comments on 2013 Less Wrong Census/Survey