Prisoner’s Dilemma

TagLast edit: 30 Sep 2020 19:08 UTC by Ruby

The Prisoner’s Dilemma is a well-studied game in game theory, where supposedly rational incentive following leads to both players stabbing each other in the back and being worse off than if they had cooperated.

The original formulation, via Wikipedia:

Two members of a criminal gang are arrested and imprisoned. Each prisoner is in solitary confinement with no means of communicating with the other. The prosecutors lack sufficient evidence to convict the pair on the principal charge, but they have enough to convict both on a lesser charge. Simultaneously, the prosecutors offer each prisoner a bargain. Each prisoner is given the opportunity either to betray the other by testifying that the other committed the crime, or to cooperate with the other by remaining silent. The possible outcomes are:

If A and B each betray the other, each of them serves two years in prison

If A betrays B but B remains silent, A will be set free and B will serve three years in prison

If A remains silent but B betrays A, A will serve three years in prison and B will be set free

If A and B both remain silent, both of them will serve only one year in prison (on the lesser charge).

The “stay silent” option is generally called Cooperate, and the “betray” option is called Defect. The only Nash Equilibrium of the Prisoner’s Dilemma is both players defecting, even though each would prefer the cooperate/cooperate outcome.

Notice that it’s only if you treat the other player’s decision as completely independent from yours, if the other player defects, then you score higher if you defect as well, whereas if the other player cooperates, you do better by defecting. Hence Nash Equilibrium to defect (at least if the game is to be played only once), and indeed, this is what classical causal decision theory says. And yet—and yet, if only somehow both players could agree to cooperate, they would both do better than if they both defected. If the players are timeless decision agents, or functional decision theory agents, they can.

A popular variant is the Iterated Prisoner’s Dilemma, where two agents play the Prisoner’s Dilemma against each other a number of times in a row. A simple and successful strategy is called Tit for Tat—cooperate on the first round, then on subsequent rounds do whatever your opponent did on the last round.

External links

Prisoner’s dilemma (Stanford Encyclopedia of Philosophy)

See also

References

Drescher, Gary (2006). Good and Real. Cambridge: The MIT Press. ISBN 0262042339.

Introduction to Prisoners’ Dilemma

Scott Alexander30 Jun 2012 0:54 UTC

62 points

6 comments5 min readLW link

The True Prisoner’s Dilemma

Eliezer Yudkowsky3 Sep 2008 21:34 UTC

229 points

117 comments4 min readLW link

The Pavlov Strategy

sarahconstantin20 Dec 2018 16:20 UTC

273 points

14 comments4 min readLW link

(srconstantin.wordpress.com)

Prisoners’ Dilemma with Costs to Modeling

Scott Garrabrant5 Jun 2018 4:51 UTC

123 points

20 comments7 min readLW link

The Epistemic Prisoner’s Dilemma

MBlume18 Apr 2009 5:36 UTC

109 points

46 comments2 min readLW link

Cooperating with agents with different ideas of fairness, while resisting exploitation

Eliezer Yudkowsky16 Sep 2013 8:27 UTC

103 points

42 comments4 min readLW link

Classifying games like the Prisoner’s Dilemma

philh4 Jul 2020 17:10 UTC

109 points

28 comments6 min readLW link 1 review

(reasonableapproximation.net)

Contrite Strategies and The Need For Standards

sarahconstantin24 Dec 2018 18:30 UTC

131 points

5 comments4 min readLW link

(srconstantin.wordpress.com)

Robust Cooperation in the Prisoner’s Dilemma

orthonormal7 Jun 2013 8:30 UTC

120 points

147 comments7 min readLW link

Real World Solutions to Prisoners’ Dilemmas

Scott Alexander3 Jul 2012 3:25 UTC

73 points

88 comments7 min readLW link

Domain Theory and the Prisoner’s Dilemma: FairBot

Gurkenglas7 May 2021 7:33 UTC

22 points

5 comments2 min readLW link

Coordinating the Unequal Treaties

lsusr25 Nov 2021 10:47 UTC

34 points

4 comments2 min readLW link

2014 iterated prisoner’s dilemma tournament results

tetronian230 Sep 2014 21:23 UTC

95 points

57 comments6 min readLW link

Prisoner’s Dilemma Tournament Results

prase6 Sep 2011 0:46 UTC

152 points

171 comments11 min readLW link

Investigating Emergent Goal-Like Behavior in Large Language Models using Experimental Economics

phelps-sg5 May 2023 11:15 UTC

6 points

1 comment4 min readLW link

Project idea: an iterated prisoner’s dilemma competition/game

Adam Zerner26 Feb 2024 23:06 UTC

8 points

0 comments5 min readLW link

Blackmail, Nukes and the Prisoner’s Dilemma

Stuart_Armstrong10 Mar 2010 14:58 UTC

25 points

20 comments2 min readLW link

A Different Prisoner’s Dilemma

Serpent-Stare14 Apr 2018 15:54 UTC

9 points

1 comment5 min readLW link

A Case for Cooperation: Dependence in the Prisoner’s Dilemma

grantstenger17 Jun 2024 1:10 UTC

9 points

2 comments23 min readLW link

Game Theory without Argmax [Part 1]

Cleo Nardo11 Nov 2023 15:59 UTC

69 points

18 comments19 min readLW link

The Darwin Game—Rounds 1 to 2

lsusr11 Nov 2020 1:53 UTC

48 points

9 comments3 min readLW link

Specialized Labor and Counterfactual Compensation

philh14 Nov 2020 18:13 UTC

18 points

2 comments19 min readLW link

(reasonableapproximation.net)

Game Theory without Argmax [Part 2]

Cleo Nardo11 Nov 2023 16:02 UTC

31 points

14 comments13 min readLW link

The Mutant Game—Rounds 11 to 30

lsusr23 Nov 2020 9:20 UTC

5 points

2 comments5 min readLW link

The Mutant Game—Rounds 31 to 90

lsusr27 Nov 2020 21:05 UTC

18 points

1 comment1 min readLW link

Player of Games

Jacob Falkovich29 Aug 2018 21:26 UTC

65 points

3 comments12 min readLW link

Re-formalizing PD

cousin_it28 Apr 2009 12:10 UTC

32 points

63 comments2 min readLW link

[Question] A way to beat superrational/EDT agents?

Abhimanyu Pallavi Sudhir17 Aug 2020 14:33 UTC

5 points

13 comments1 min readLW link

Four levels of understanding decision theory

Max H1 Jun 2023 20:55 UTC

12 points

11 comments4 min readLW link

Announcing the 2014 program equilibrium iterated PD tournament

tetronian231 Jul 2014 12:24 UTC

38 points

63 comments1 min readLW link

Most Prisoner’s Dilemmas are Stag Hunts; Most Stag Hunts are Schelling Problems

abramdemski14 Sep 2020 22:13 UTC

177 points

36 comments10 min readLW link 3 reviews

The Darwin Game

lsusr9 Oct 2020 10:19 UTC

91 points

131 comments3 min readLW link

Fairness vs. Goodness

Eliezer Yudkowsky22 Feb 2009 20:22 UTC

15 points

21 comments1 min readLW link

The Darwin Game—Rounds 21-500

lsusr21 Nov 2020 0:58 UTC

27 points

13 comments2 min readLW link

Intuitions about utilities

mingyuan6 Feb 2021 0:12 UTC

32 points

3 comments4 min readLW link

Defending Functional Decision Theory

Heighn8 Feb 2022 14:58 UTC

6 points

10 comments11 min readLW link

The Calculus of Nash Equilibria

Heighn1 Apr 2022 14:40 UTC

4 points

0 comments2 min readLW link

The Platonist’s Dilemma: A Remix on the Prisoner’s.

James Camacho12 Apr 2022 3:49 UTC

5 points

2 comments5 min readLW link

FDT defects in a realistic Twin Prisoners’ Dilemma

SMK15 Sep 2022 8:55 UTC

38 points

1 comment26 min readLW link

Conditions for Superrationality-motivated Cooperation in a one-shot Prisoner’s Dilemma

Jim Buhler19 Dec 2022 15:00 UTC

24 points

4 comments5 min readLW link

Logical Line-Of-Sight Makes Games Sequential or Loopy

StrivingForLegibility19 Jan 2024 4:05 UTC

39 points

0 comments7 min readLW link

Reframing Acausal Trolling as Acausal Patronage

StrivingForLegibility23 Jan 2024 3:04 UTC

14 points

0 comments2 min readLW link

Game Theory and Society

Zero Contradictions5 Aug 2024 4:27 UTC

4 points

0 comments1 min readLW link

(thewaywardaxolotl.blogspot.com)

A free to enter, 240 character, open-source iterated prisoner’s dilemma tournament

Isaac King9 Nov 2023 8:24 UTC

64 points

19 comments1 min readLW link

(manifold.markets)

Predictable Defect-Cooperate?

quetzal_rainbow18 Nov 2023 15:38 UTC

7 points

1 comment2 min readLW link

Prisoner’s Dilemma (with visible source code) Tournament

AlexMennen7 Jun 2013 8:30 UTC

73 points

236 comments2 min readLW link

Prisoner’s dilemma tournament results

AlexMennen9 Jul 2013 20:50 UTC

54 points

124 comments1 min readLW link

The continued misuse of the Prisoner’s Dilemma

SilasBarta23 Oct 2009 3:48 UTC

34 points

70 comments2 min readLW link

Paper: Iterated Prisoner’s Dilemma contains strategies that dominate any evolutionary opponent

mapnoterritory2 Jun 2012 20:50 UTC

39 points

19 comments2 min readLW link

Fixed-Length Selective Iterative Prisoner’s Dilemma Mechanics

Andreas_Giger13 Sep 2011 3:24 UTC

34 points

14 comments15 min readLW link

Prisoner’s Dilemma on game show Golden Balls

atorm21 Apr 2012 0:31 UTC

28 points

32 comments1 min readLW link

The Counterfactual Prisoner’s Dilemma

Chris_Leong21 Dec 2019 1:44 UTC

21 points

17 comments3 min readLW link

The Truly Iterated Prisoner’s Dilemma

Eliezer Yudkowsky4 Sep 2008 18:00 UTC

31 points

86 comments1 min readLW link

Prisoner’s Dilemma as a Game Theory Laboratory

prase25 Aug 2011 14:30 UTC

22 points

47 comments3 min readLW link

[LINK] Cantor’s theorem, the prisoner’s dilemma, and the halting problem

Qiaochu_Yuan30 Jun 2013 20:26 UTC

22 points

9 comments1 min readLW link

Newcomb’s Problem vs. One-Shot Prisoner’s Dilemma

Wei Dai7 Apr 2009 5:32 UTC

14 points

16 comments1 min readLW link

Reflexive Oracles and superrationality: prisoner’s dilemma

Stuart_Armstrong24 May 2017 8:34 UTC

14 points

5 comments4 min readLW link

Prisoner’s Dilemma vs the Afterlife

DataPacRat24 Sep 2013 16:59 UTC

19 points

69 comments2 min readLW link

Other prespective on resolving the Prisoner’s dilemma

Stuart_Armstrong4 Jun 2013 16:13 UTC

17 points

34 comments1 min readLW link

Another Iterated Prisoner’s Dilemma Tournament?

Andreas_Giger25 May 2012 14:16 UTC

14 points

23 comments1 min readLW link

The True Epistemic Prisoner’s Dilemma

MBlume19 Apr 2009 8:57 UTC

24 points

72 comments2 min readLW link

New prisoner’s dilemma and chicken tournament

benelliott14 Sep 2011 8:00 UTC

10 points

13 comments2 min readLW link

Agent-Simulates-Predictor Variant of the Prisoner’s Dilemma

Gram_Stone15 Dec 2015 7:17 UTC

11 points

34 comments2 min readLW link

Iterated Prisoner’s Dilemma in software patents

RolfAndreassen22 Jul 2013 20:22 UTC

6 points

8 comments2 min readLW link

Pavlov Generalizes

abramdemski20 Feb 2019 9:03 UTC

67 points

4 comments7 min readLW link

No comments.