One idea I take out of your definition of A is that it’s potentially useful to think about decisions as giving “strategies” that respond not just to “observations” (that were “withheld” during a decision), but also to “observations about utility”, that could factor out the differences between different agents and allow them to coordinate, the way a single agent can coordinate with its copies or counterfactual variants by deciding on a whole strategy instead of just a single action. Not sure if (or how) that works for PD though...
One idea I take out of your definition of A is that it’s potentially useful to think about decisions as giving “strategies” that respond not just to “observations” (that were “withheld” during a decision), but also to “observations about utility”, that could factor out the differences between different agents and allow them to coordinate, the way a single agent can coordinate with its copies or counterfactual variants by deciding on a whole strategy instead of just a single action. Not sure if (or how) that works for PD though...