Is that consistent with the scoring of iterated prisoners’ dilemma, or is it a different game? The goal of abstract games is to maximize one’s score at the end of the game (or in infinite games, maximize the average score per time across infinite time)
The expected score of a discounting defector with per-round discount fraction p versus a cooperate-then reciprocate player in the [3,4;1,2] matrix after n-1 rounds would be 4+sum_{r=1}{n}2rp.The expected score of a cooperate-then reciprocate player against the same opponent would be 3+sum_{r=1}{n}3rp.
A quick estimate says that for a p of .5, the two scores are the same over infinite time.
Is that consistent with the scoring of iterated prisoners’ dilemma, or is it a different game? The goal of abstract games is to maximize one’s score at the end of the game (or in infinite games, maximize the average score per time across infinite time)
The expected score of a discounting defector with per-round discount fraction p versus a cooperate-then reciprocate player in the [3,4;1,2] matrix after n-1 rounds would be 4+sum_{r=1}{n}2rp.The expected score of a cooperate-then reciprocate player against the same opponent would be 3+sum_{r=1}{n}3rp.
A quick estimate says that for a p of .5, the two scores are the same over infinite time.
It is, for the reasons you suggest, a different game.