Dagon comments on Buridan’s ass in coordination games

Dagon Jul 16, 2018, 7:48 PM
2 points
AF
R∼Uniform([0,1])
How can it possibly matter whether R is chosen before or after uy? R is completely independent of u, right? It’s not a covert communication mechanism about the players’ observations, it’s a random value.
- jessicata Jul 16, 2018, 7:51 PM
  LW: 1 AF: 1
  AF Parent
  If $u_{y}$ is chosen after $R$ then it might be chosen to depend on $R$ in such a way that the algorithm gets bad performance, e.g. using the method in the proof of Claim 1.
  - Dagon Jul 17, 2018, 5:04 PM
    LW: 4 AF: 1
    AF Parent
    Based on other comments, I realize I’m making an assumption for something you haven’t specified. How is uy chosen? If it’s random and independent, then my assertion holds, if it’s selected by an adversary who knows the players’ full strategies somehow, then R is just a way of keeping a secret from the adversary—sequence doesn’t matter, but knowledge does.
    - jessicata Jul 17, 2018, 8:21 PM
      LW: 0 AF: 1
      AF Parent
      Claim 1 says there exists some $u_{y}$ value for which the algorithm gets high regret, so we might as well assume it’s chosen to maximize regret.
      Claim 2 says the algorithm has low regret regrardless of $u_{y}$ , so we might as well assume it’s chosen to maximize regret.
  - Dagon Jul 16, 2018, 7:59 PM
    LW: 0 AF: -1
    AF Parent
    uy and R are independently chosen from well-defined distributions. Regardless of sequence, neither knows the other and CANNOT be chosen based on the other. I’ll see if I can find time tonight to figure out whether I’m saying your claim 1 is wrong (it dropped epsilon too soon from the floor value, but I’m not sure if it’s more fundamentally problematic than that) or that your claim 2 is misleading.
    My current expectation is that I’ll find that your claim 2 results are available in situation 1, by using your given function with a pre-agreed value rather than a random one.
    - Rohin Shah Jul 17, 2018, 12:49 AM
      LW: 4 AF: 2
      AF Parent
      The theorems are of the form “For all uy, you get good outcomes” or “There exists a uy that causes bad outcomes”.
      When you want to prove statements of this form, uy is chosen adversarially, so it matters whether it is chosen before or after R.
      uy and R are independently chosen from well-defined distributions.
      What distribution is uy chosen from? That’s not specified anywhere in the post.
    - Laszlo_Treszkai Jul 17, 2018, 7:02 AM
      1 point
      AF Parent
      True, they will fail to cooperate for some R, but the values of such R have a low probability. (But yeah, it’s also required that uy and R are chosen independently—otherwise an adversary could just choose either so that it results in the players choosing different actions.)
      
      The smoothness comes in from marginalising a random R. The coordination comes from making R and ε common knowledge, so they cooperate using the correlation in their observations—an interesting phenomenon.
      
      (How can I write LaTeX in the comments?)
      - jessicata Jul 17, 2018, 7:53 AM
        LW: 1 AF: 1
        AF Parent
        (How can I write LaTeX in the comments?)
        ctrl-4