Vladimir_Nesov comments on Domain Theory and the Prisoner’s Dilemma: FairBot

Vladimir_Nesov 9 May 2021 23:20 UTC
2 points
0
We can model events (such as states of a program) with posets, related with each other by monotone maps. By beliefs I mean such posets or their elements (belief states). A state of an event can be enacted by an agent if the agent can bring it into that state. So if the agent decides on an action, that action can be enacted. But if instead the agent only decides on things like beliefs about actions (powersets of sets of possible actions), these can’t be enacted, for example it can’t ensure that the action is {C, D} or ⊥, that doesn’t make sense. But for the beliefs about future states of agent’s program that are themselves states of belief, modeled as themselves, the agent can enact them, and that makes them an excellent target for decision making. I think this is the important takeaway from the modal decisions setting, but the setting lacks the component of choosing between possible solutions according to preference, instead it’s manipulating the pseudoenvironment between beliefs and actions to get something useful out of its incomplete decision making machinery.

We could say that a one-query player “decides to defect” if his query is proven false.

My point is that in principle, even for a belief whose set of states is moderately large (which the picture in the first comment of this thread gestures at), the action may be defined to depend on the belief state in an arbitrary way, perhaps switching back and forth between C and D as a belief gets stronger and stronger. That is because the action doesn’t play a fundamental role in the decision making, only belief does (in this setting, statements with proofs), but we are not making use of the ability to choose which things have proofs according to preference, so there’s this whole thing about carefully choosing how actions depend on beliefs, which doesn’t work very well.
- Gurkenglas 10 May 2021 7:09 UTC
  2 points
  0
  Parent
  Yeah, enacting comes in at a higher level of interpretation than is yet considered here. The increasing levels of interpretation here are: Set theory or other math foundations; we consider sets of queries and beliefs and players with functions between them; we add porder and monotonicity; we specify proof engines and their properties like consistency; we define utilities, decision theories, and what makes some players better than others. (Category theory is good at keeping these separate.) I’d start talking about “enacting” when we define a decision theory like “Make the decision such that I can prove the best lower bound on utility.”. What do you mean by deciding on a belief state? “Decision” is defined before I establish any causation from decisions to beliefs.
  
  Oh, I thought you meant you didn’t see why any two beliefs had an upper bound. My choice to make players monotonic comes from intuition that that’s how the math is supposed to look. I’d define Query=P(Decision) as Decision->2 as well but that plainly makes no sense so I’m looking for the true posetty definition of Query, and “logical formulas” looks good so far. Switching back and forth sounds more like you want to do multiple decisions, one after the other. There’s also a more grounded case to be made that your policy should become more certain as your knowledge does, do you see it?