Eliezer Yudkowsky comments on Nash Equilibria and Schelling Points

Eliezer Yudkowsky 29 Jun 2012 15:52 UTC
26 points
0
It’s amazing, the results people come up with when they don’t use TDT (or some other formalism that doesn’t defect in the Prisoner’s Dilemma—though so far as I know, the concept of the Blackmail Equation is unique to TDT.)

(Because the base case of the pirate scenario is, essentially, the Ultimatum game, where the only reason the other person offers you $1 instead of $5 is that they model you as accepting a $1 offer, which is a very stupid answer to compute if it results in you getting only $1 - only someone who two-boxed on Newcomb’s Problem would contemplate such a thing.)
- cousin_it 29 Jun 2012 16:10 UTC
  17 points
  0
  Parent
  At some point you proposed to solve the problem of blackmail by responding to offers but not to threats. Do you have a more precise version of that proposal? What logical facts about you and your opponent indicate that the situation is an offer or a threat? I had problems trying to figure that out.
  - [deleted] 29 Jun 2012 18:33 UTC
    11 points
    0
    Parent
    I have a possible idea for this, but I think I need help working out more the rules for the logical scenario as well. All I have are examples (and It’s not like examples of a threat are that tricky to imagine.)
    
    Person A makes situations that involve some form of request (an offer, a a series of offers, a threat, etc.). Person B may either Accept, Decline, or Revoke Person A’s requests. Revoking a request blocks requests from occurring at all, at a cost.
    
    Person A might say “Give me 1 dollar and I’ll give you a Frozen Pizza.” And Person B might “Accept” if Frozen Pizza grants more utility than a dollar would.
    
    Person A might say “Give me 100 dollars and I’ll give you a Frozen Pizza.” Person B would “Decline” the offer, since Frozen Pizza probably wouldn’t be worth more than 100 dollars, but he probably wouldn’t bother to revoke it. Maybe Person A’s next situation will be more reasonable.
    
    Or Person A might say “Give me 500 dollars or I’ll kill you.” And Person B will pick “Revoke” because he doesn’t want that situation to occur at all. The fact that there is a choice between death or minus 500 dollars is not a good situation. He might also revoke future situations from that person.
    
    Alternate examples: If you’re trying to convince someone to go out on a date, they might say “Yes”, “No”, or “Get away from me, you creep!”
    
    If you are trying to enter a password to a computer system, they might allow access (correct password), deny access (incorrect password), or deny access and lock access attempts for some period (multiple incorrect passwords)
    
    Or if you’re at a receptionist desk:
    
    A: “I plan on going to the bathroom, Can you tell me where it is?”
    
    B: “Yes.”
    
    A: “I plan on going to a date tonight, Would you like to go out with me to dinner?”
    
    B: “No.”
    
    A: “I plan on taking your money, can you give me the key to the safe this instant?”
    
    B: “Security!”
    
    The difference appears to be that if it is a threat (or a fraud) you not only want to decline the offer, you want to decline future offers even if they look reasonable because the evidence from the first offer was that bad. Ergo, if someone says:
    
    A: “I plan on taking your money, can you give me the key to the safe this instant?”
    
    B: “Security!”
    
    A: “I plan on going to the bathroom, Can you tell me where it is?”
    
    B: (won’t say yes at this point because of the earlier threat) “SECURITY!”
    
    Whereas for instance, in the reception scenario the date isn’t a threat, so:
    
    A: “I plan on going to a date tonight, Would you like to go out with me to dinner?”
    
    B: “No.”
    
    A: “I plan on going to the bathroom, Can you tell me where it is?”
    
    B: “Yes.”
    
    I feel like this expresses threats or frauds to clearly me, but I’m not sure if it would be clear to someone else. Did it help? Are there any holes I need to fix?
    - Vaniver 29 Jun 2012 18:51 UTC
      22 points
      0
      Parent
      The doctor walks in, face ashen. “I’m sorry- it’s likely we’ll lose her or the baby. She’s unconscious now, and so the choice falls to you: should we try to save her or the child?”
      
      The husband calmly replies, “Revoke!”
      
      In non-story format: how do you formalize the difference between someone telling you bad news and someone causing you to be in a worse situation? How do you formalize the difference between accidental harm and intentional harm? How do you determine the value for having a particular resistance to blackmail, such that you can distinguish between blackmail you should and shouldn’t give in to?
      - A1987dM 1 Jul 2012 9:32 UTC
        4 points
        0
        Parent
        
        How do you determine the value for having a particular resistance to blackmail, such that you can distinguish between blackmail you should and shouldn’t give in to?
        
        The doctor has no obvious reason to prefer you to want to save your wife or your child. On the other hand, the mugger would very much prefer you to hand him your wallet than to accept to be killed, and so he’s deliberately making the latter possibility as unpleasant to you as possible to make you choose the former; but if you had precommitted to not choosing the former (e.g. by leaving your wallet at home) and he had known it, he wouldn’t have approached you in the first place.
        
        IOW this is the decision tree:
        
        mug give in ------------------------------------------------ (+50,-50) | | | | don't give in | ---------------------- (-1,-1e6) | don't mug ---------------------------------------------- (0, 0)
        where the mugger makes the first choice, you make the second choices, and the numbers in parentheses are the pay-offs for the mugger and for you respectively. If you precommit not to choose the top branch, the mugger will take the bottom branch. (How do I stop multiple spaces from being collapsed into one?
      - [deleted] 29 Jun 2012 19:27 UTC
        3 points
        0
        Parent
        
        The doctor walks in, face ashen. “I’m sorry- it’s likely we’ll lose her or the baby. She’s unconscious now, and so the choice falls to you: should we try to save her or the child?” The husband calmly replies, “Revoke!”
        
        An eloquent way of pointing out what I was missing. Thank you!
        
        In non-story format: How do you formalize the difference between someone telling you bad information and someone causing you to be in a worse situation?
        
        I will try to think on this more. The only thing that’s occurred to me so far is that if that it seems like if you have a formalization, it may not be a good idea to announce your formalization. Someone who knows your formalization might be able to exploit it by customizing their imposed worse situation to look like simply telling you bad information, their intentional harm to look like accidental harm, or their blackmail to extort the maximum amount of money out of you, if they had an explicit set of formal rules about where those boundaries were.
        
        And for instance, it seems like a person would prefer it someone else blackmailed that person less than they could theoretically get away with because they were being cautious, rather than having every blackmailer immediately blackmail at maximum effective blackmail. (at that point, since the threshold can change)
        
        Again, I really do appreciate you helping me focus my thoughts on this.
        What links here?
        Vaniver's comment on Nash Equilibria and Schelling Points by Scott Alexander (29 Jun 2012 21:29 UTC; 6 points)
      - TheOtherDave 29 Jun 2012 19:33 UTC
        0 points
        0
        Parent
        If I have a choice of whether or not to perform an action A, and I believe that performing A will harm agent X and will not in and of itself benefit me, and I credibly commit to performing A unless X provides me with some additional value V, I would consider myself to be threatening X with A unless they provide V. Whether that is a threat of blackmail or some other kind of threat doesn’t seem like a terribly interesting question.
        
        Edit: my earlier thoughts on extortion/blackmail, specifically, here.
        Vaniver 29 Jun 2012 21:29 UTC
        6 points
        0
        Parent
        
        will not in and of itself benefit me
        
        Did you ever see Shawshank Redemption? One of the Warden’s tricks is not just to take construction projects with convict labor, but to bid on any construction project (with the ability to undercut any competitor because his labor is already paid for) unless the other contractors paid him to stay away from that job.
        
        My thought, as hinted at by my last question, is that refusing or accepting any particular blackmail request depends on the immediate and reputational costs of refusing or accepting. A flat “we will not accept any blackmail requests” is emotionally satisfying to deliver, but can’t be the right strategy for all situations. (When the hugger mugger demands “hug me or I’ll shoot!”, well, I’ll give him a hug.) A “we will not accept any blackmail requests that cost more than X” seems like the next best step, but as pointed out here that runs the risk of people just demanding X every time. Another refinement might be to publish a “acceptance function”- you’ll accept a (sufficiently credible and damaging) blackmail request for x with probability f(x), which is a decreasing (probably sigmoidal) function.
        
        But the reputational costs of accepting or rejecting vary heavily based on the variety of threat, what you believe about potential threateners, whose opinions you care about, and so on. Things get very complex very fast.
        TheOtherDave 29 Jun 2012 22:54 UTC
        2 points
        0
        Parent
        If I am able to outbid all competitors for any job, but cannot do all jobs, and I let it be known that I won’t bid on jobs if bribed accordingly, I would not consider myself to be threatening all the other contractors, or blackmailing them. In effect this is a form of rent-seeking.
        
        The acceptance-function approach you describe, where the severity and credibility of the threat matter, makes sense to me.
        Vaniver 30 Jun 2012 1:02 UTC
        0 points
        0
        Parent
        Blackmail seems to me to be a narrow variety of rent-seeking, and reasons for categorically opposing blackmail seem like reasons for categorically opposing rent-seeking. But I might be using too broad a category for ‘rent-seeking.’
        TheOtherDave 30 Jun 2012 1:09 UTC
        3 points
        0
        Parent
        
        reasons for categorically opposing blackmail seem like reasons for categorically opposing rent-seeking
        
        Well, I agree, but only because in general the reasons for categorically opposing something that would otherwise seem rational to cooperate with are similar. That is, the strategy of being seen to credibly commit to a policy of never rewarding X, even when rewarding X would leave me better off, is useful whenever such a strategy reduces others’ incentive to X and where I prefer that people not X at me. It works just as well where X=rent-seeking as where X=giving me presents as where X=threatening me.
        
        Can you expand on your model if rent-seeking?
        Vaniver 30 Jun 2012 1:22 UTC
        0 points
        0
        Parent
        
        Can you expand on your model if rent-seeking?
        
        Yes but I’m not sure how valuable it is to. Basically, it boils down to ‘non-productive means of acquiring wealth,’ but it’s not clear if, say, petty theft should be included. (Generally, definitional choices like that there are made based on identity implications, rather than economic ones.) The general sentiment of things “I prefer that people not X at me” captures the essence better, perhaps.
        
        There are benefits to insisting on a narrower definition: perhaps something like legal non-productive means of acquiring wealth, but part of the issue is that rent-seeking often operates by manipulating the definition of ‘legal.’
  - bideup 9 Mar 2021 16:36 UTC
    1 point
    0
    Parent
    Here’s my version of the definition used by Schelling in The Strategy of Conflict: A threat is when I commit myself to an action, conditional on an action of yours, such that if I end up having to take that action I would have reason to regret having committed myself to it.
    So if I credibly commit myself to the assertion, ‘If you don’t give me your phone, I’ll throw you off this ship,’ then that’s a threat. I’m hoping that the situation will end with you giving me your phone. If it ends with me throwing you overboard, the penalties I’ll incur will be sufficient to make me regret having made the commitment.
    But when these rational pirates say, ‘If we don’t like your proposal, we’ll throw you overboard,’ then that’s not a threat; they’re just elucidating their preferences. Schelling uses ‘warning’ for this sort of statement.
- raptortech97 12 Sep 2016 3:37 UTC
  6 points
  0
  Parent
  So if all pirates implement TDT, what happens?
- Pentashagon 29 Jun 2012 23:42 UTC
  −1 points
  0
  Parent
  I’ll guess that in your analysis, given the base case of D and E’s game being a tie vote on a (D=100, E=0) split, results in a (C=0, D=0, E=100) split for three pirates since E can blackmail C into giving up all the coins in exchange for staying alive? D may vote arbitrarily on a (C=0, D=100, E=0) split, so C must consider E to have the deciding vote.
  
  If so, that means four pirates would yield (B=0, C=100, D=0, E=0) or (B=0, C=0, D=100, E=0) in a tie. E expects 100 coins in the three-pirate game and so wouldn’t be a safe choice of blackmailer, but C and D expect zero coins in a three-pirate game so B could choose between them arbitrarily. B can’t give fewer than 100 coins to either C or D because they will punish that behavior with a deciding vote for death, and B knows this. It’s potentially unintuitive for C because C’s expected value in a three-pirate game is 0 but if C commits to voting against B for anything less than 100 coins, and B knows this, then B is forced to give either 0 or 100 coins to C. The remaining coins must go to D.
  
  In the case of five pirates C and D except more than zero coins on average if A dies because B may choose arbitrarily between C or D as blackmailer. B and E expect zero coins from the four-pirate game. A must maximize the chance that two or more pirates will vote for A’s split. C and D have an expected value of 50 coins from the four-pirate game if they assume B will choose randomly, and so a (A=0, B=0, C=50, D=50, E=0) split is no better than B’s expected offer for C and D and any fewer than 50 coins for C or D will certainly make them vote against A. I think A should offer (A=0, B=n, C=0, D=0, E=100-n) where n is mutually acceptable to B and E.
  
  Because B and E have no relative advantage in a four-pirate game (both expect zero coins) they don’t have leverage against each other in the five-pirate game. If B had a non-zero probability of being killed in a four-pirate game then A should offer E more coins than B at a ratio corresponding to that risk. As it is, I think B and E would accept a fair split of n=50, but I may be overlooking some potential for E to blackmail B.
  - Brilliand 30 Sep 2015 22:08 UTC
    0 points
    0
    Parent
    In every case of the pirates game, the decision-maker assigns one coin to every pirate an even number of steps away from himself, and the rest of the coins to himself (with more gold than pirates, anyway; things can get weird with large numbers of pirates). See the Wikipedia article Kawoomba linked to for an explanation of why.