Stuart_Armstrong comments on Game Theory of the Immortals

Stuart_Armstrong 11 Mar 2013 18:13 UTC
6 points
0
The discount factor can mess things up—you’ll meet someone again, but after how long?
- Crystalist 11 Mar 2013 18:36 UTC
  0 points
  0
  Parent
  I’m not sure I see your point. My reasoning was that if you meet the same person on average every thousand games in an infinite series of games, you’ll end up meeting them an infinite number of times. Am I confusing the sample space with the event space?
  - Stuart_Armstrong 11 Mar 2013 18:45 UTC
    4 points
    0
    Parent
    If you have a strong discount factor, then even if you meet the same person infinitely often, your gain is still bounded above (summing a geometric series), and can be much smaller than winning your current round.
    - Crystalist 11 Mar 2013 18:53 UTC
      4 points
      0
      Parent
      face-palm Ah yes. Thanks.
    - Decius 11 Mar 2013 20:14 UTC
      0 points
      0
      Parent
      How can R/(1-p) diminish when R and p are constant? Are you discounting future games as worth less than this game, and is that consistent with the scoring of iterated prisoner’s dilemma?
      - Stuart_Armstrong 12 Mar 2013 11:04 UTC
        0 points
        0
        Parent
        
        Are you discounting future games as worth less than this game
        
        Yes, that’s what discounting does. If you have a discounted iterated PD, you have to do something like that. And it R/(1-p) is smaller than profiteering in your current interaction, you’ll profiteer in your current action.
        Decius 14 Mar 2013 2:30 UTC
        2 points
        0
        Parent
        Is that consistent with the scoring of iterated prisoners’ dilemma, or is it a different game? The goal of abstract games is to maximize one’s score at the end of the game (or in infinite games, maximize the average score per time across infinite time)
        
        The expected score of a discounting defector with per-round discount fraction p versus a cooperate-then reciprocate player in the [3,4;1,2] matrix after n-1 rounds would be 4+ $sum_{r=1}{n}2rp$ .The expected score of a cooperate-then reciprocate player against the same opponent would be 3+ $sum_{r=1}{n}3rp$ .
        
        A quick estimate says that for a p of .5, the two scores are the same over infinite time.
        wedrifid 14 Mar 2013 2:56 UTC
        0 points
        0
        Parent
        
        Is that consistent with the scoring of iterated prisoners’ dilemma, or is it a different game?
        
        It is, for the reasons you suggest, a different game.