Looking ahead multiple moves seems sufficient to break the equilibrium, but for the started assumption that the other players also have deeply flawed models of your behavior that assume you’re using a different strategy—the shared one including punishment.
There does seem to be something fishy/circular about baking an assumption about other players strategy into the player’s own strategy and omitting any ability to update.
Looking ahead multiple moves seems sufficient to break the equilibrium, but for the started assumption that the other players also have deeply flawed models of your behavior that assume you’re using a different strategy—the shared one including punishment. There does seem to be something fishy/circular about baking an assumption about other players strategy into the player’s own strategy and omitting any ability to update.