StrivingForLegibility comments on Game Theory without Argmax [Part 2]

StrivingForLegibility 23 Nov 2023 7:49 UTC
2 points
−3
Edit: Cleo Nardo has confirmed that they intended $\prod i \in I$ to mean the cartesian product of sets, the ordinary thing for that symbol to mean in that context. I misunderstood the semantics of what $B (x)$ was intended to represent. I’ve updated my implementation to use the intended cartesian product when calculating the best response function, the rest of this comment is my initial (wrong) interpretation of $B (x)$ .
I needed to go back to one of the papers cited in Part 1 to understand what that $\prod i \in I$ was doing in that expression. I found the answer in A Generalization of Nash’s Theorem with Higher-Order Functionals. I’m going to do my best to paraphrase Hedges’ notation into Cleo’s notation, to avoid confusion.
TLDR: $B (x)$ is picking out the set of option-profiles $P (X)$ that are simultaneously best-responses by all players to that option-profile $x$ . It does this by considering all of the option-profiles that can result by each player best-responding, then takes the intersection of those sets.
On page 6, Hedges defines the best response correspondence $B \in X \to P (X)$
$B (x) = ⋂ i \in I B_{i} (x)$
Where
$B_{i} \in X \to P (X)$
Hedges builds up the idea of Nash Equilibria using quantifiers rather than optimizers, (like $max$ rather than $argmax$ ), but I believe the approaches are equivalent. Unpacking $B : x \mapsto \prod i \in I ψ_{i} (g \circ U_{i} (x))$ from the inside out:
$U_{i} (x) \in X_{i} \to X$
$g \circ U_{i} (x) \in X_{i} \to R$
That makes $g \circ U_{i} (x$ ) a $ψ_{i}$ -task. Since $ψ_{i} \in (X_{i} \to R) \to P (X_{i})$ , we know that $ψ_{i} (g \circ U_{i} (x)) \in P (X_{i})$ .
This is where I had to go looking through papers. What sort of product takes a set of best-responses from each player, relative to a given option-profile, and returns a set of option-profiles that are simultaneously regarded by each player as a best-response? I thought about just taking the Cartesian product of the sets, but that wouldn’t get us only the mutual best-responses.
Let’s call the way that each player maps option-profiles to best-responses $b_{i} \in X \to P (X_{i})$ . This is exactly the sets we want to take the product of:
$b_{i} (x) = ψ_{i} (g \circ U_{i} (x))$
Hedges introduces notation on page 3 to handle the operation of taking an option-profile, varying one player’s option, and leaving the rest the same. Paraphrasing, Hedges defines $x (i \mapsto α) \in \prod j \in I X_{j}$ by
$x (i \mapsto α)_{j} = {\begin{matrix} α & if i = j x_{j} & o t h e r w i s e \end{matrix}$
You can read $x (i \mapsto α)$ as “give me a new copy of $x$ , where the $i$ th entry has been set to the value $α$ .” Hedges uses this to define the deviation maps equivalently to the way Cleo did. $U_{i} : X \to (X_{i} \to X)$
$U_{i} (x) (α) = x (i \mapsto α)$
The correspondences $B_{i} \in X \to P (X)$ take as input an option profile, and returns the set of option-profiles which are player $i$ ’s optimal unilateral deviations from that option profile. To construct $B_{i}$ from $b_{i}$ , we want to map $b_{i} (x) \in P (X_{i})$ to the option-profiles which deviate from $x$ in those exact ways.
$B_{i} (x) = {x (i \mapsto α) : α \in b_{i} (x)}$
We can then use Hedges’ $B (x) = ⋂ i \in I B_{i} (x)$ to get the best-response correspondence! We can unpack this to get a definition of $B$ using objects that Cleo defined, using that deviation notation from Hedges:
$B (x) = ⋂ i \in I {x (i \mapsto α) : α \in ψ_{i} (g \circ U_{i} (x))}$
Thank you Cleo for writing this article! This was my first introduction to Higher-Order Game Theory, and I wrote up an implementation in TypeScript to help me understand how all of the pieces fit together!
- rotatingpaguro 23 Nov 2023 18:15 UTC
  2 points
  −1
  Parent
  I’m weirded out by this. To look at everything together, I write the original expression, and your expression rewritten using the OP’s notation:
  Original: $B : x \mapsto \prod i \in I ψ_{i} (g \circ U_{i} (x))$
  Yours: $\begin{matrix} B (x) & = ⋂ i \in I {x (i \mapsto α) : α \in ψ_{i} (g \circ U_{i} (x))} = ⋂ i \in I U_{i} (x) (ψ_{i} (g \circ U_{i} (x))) \end{matrix}$
  (I’m using the notation that a function applied to a set is the image of that set.)
  So the big pi symbol stands for
  $\prod_{i \in I} A_{i} = ⋂_{i \in I} U_{i} (x) (A_{i})$
  So it’s not a standalone operator: it’s context-dependent because it pops out an implicit $x$ . The OP otherwise gives the impression of a more functional mindset, so I suspect the OP may mean something different from your guess.
  Other problem with your interpretation: it yields the empty set unless all agents consider doing nothing an option. The only possible non-empty output is ${x}$ . Reason: each set you are intersecting contains tuples with all elements equal to the ones in $x$ , but for one. So the intersection will necessarily only contain tuples with all elements equal to those in $x$ .
  - StrivingForLegibility 24 Nov 2023 2:59 UTC
    2 points
    −6
    Parent
    Edit: Cleo Nardo has confirmed that they intended $\prod i \in I$ to mean the cartesian product of sets, the ordinary thing for that symbol to mean in that context. I misunderstood the semantics of what $B (x)$ was intended to represent. I’ve updated my implementation to use the intended cartesian product when calculating the best response function, the rest of this comment is based on my initial (wrong) interpretation of $B (x)$ .
    I write the original expression, and your expression rewritten using the OP’s notation:
    Original: $B : x \mapsto \prod i \in I ψ_{i} (g \circ U_{i} (x))$
    Yours: $\begin{matrix} B (x) & = ⋂ i \in I {x (i \mapsto α) : α \in ψ_{i} (g \circ U_{i} (x))} = ⋂ i \in I U_{i} (x) (ψ_{i} (g \circ U_{i} (x))) \end{matrix}$
    (I’m using the notation that a function applied to a set is the image of that set.)
    This is a totally clear and valid rewriting using that notation! My background is in programming and I spent a couple minutes trying to figure out how mathematicians write “apply this function to this set.”
    I believe the way that $B (x)$ is being used is to find Nash equilibria, using Cleo’s definition 6.5:
    Like before, the $Ψ$ -nash equilibria of $g$ is the set of option-profiles $x \in X$ such that $x \in B (x)$ .
    These are going to be option-profiles where “not deviating” is considered optimal by every player simultaneously. I agree with your conclusion that this leads $B (x)$ to take on values that are either ${}$ or ${x}$ . When $B (x) = {}$ , this indicates that $x$ is not a Nash equilibrium. When $B (x) = {x}$ , we know that $x$ is a Nash equilibrium.
    - rotatingpaguro 24 Nov 2023 20:30 UTC
      1 point
      1
      Parent
      Oh I see now, $B$ just needs to work to pinpoint Nash equilibria, I did not make that connection.
      But anyway, the reason I’m suspicious of your interpretation is not that your math is not correct, but that it makes the OP notation so unnatural. The unnatural things are:
      $\prod$ being context-dependent.
      $\prod$ not having its standard meaning.
      $U_{i}$ used implicitly instead of explicitly, when later it takes on a more important role to change decision theory.
      Using $x \in B (x)$ as condition without mentioning that already $B (x) \neq \emptyset ⟺ x is Nash$ if $| I | \geq 2$ .
      So I guess I will stay in doubt until the OP confirms “yep I meant that”.
      - Cleo Nardo 25 Nov 2023 0:35 UTC
        4 points
        1
        Parent
        $B (x) \neq \emptyset$ isn’t equivalent to $x$ being Nash.
        Suppose Alice and Bob are playing prisoner’s dilemma. Then the best-response function of every option-profile is nonempty. But only one option-profile is nash.
        $x \in B (x)$ is equivalent to $x$ being Nash.