In a game with any finite number of players, and any finite number of actions per player.
Let O=A1×A2×... the set of possible outcomes.
Player i implements policy Pi:P(O)→Ai . For each outcome in o∈O , each player searches for proofs (in PA) that the outcome is impossible. It then takes the set of outcomes it has proved impossible, and maps that set to an action.
There is always a unique action that is chosen. Whatsmore, given oracles for
Qi(U)=∪U⊆VPi(V):P(O)→P(Ai)
Ie the set of actions you might take if you can prove at least the impossility results in U and possibly some others.
Given such an oracle Qi for each agent, there is an algorithm for their behaviour that outputs the fixed point in polynomial (in |O| ) time.
In a game with any finite number of players, and any finite number of actions per player.
Let O=A1×A2×... the set of possible outcomes.
Player i implements policy Pi:P(O)→Ai . For each outcome in o∈O , each player searches for proofs (in PA) that the outcome is impossible. It then takes the set of outcomes it has proved impossible, and maps that set to an action.
There is always a unique action that is chosen. Whatsmore, given oracles for
Qi(U)=∪U⊆VPi(V):P(O)→P(Ai)Ie the set of actions you might take if you can prove at least the impossility results in U and possibly some others.
Given such an oracle Qi for each agent, there is an algorithm for their behaviour that outputs the fixed point in polynomial (in |O| ) time.