gwern comments on Incorporating Mechanism Design Into Decision Theory

gwern 2 Feb 2024 1:42 UTC
3 points
0

It looks to be available behind a paywall here.

Both the book & individual chapter (by DOI) are available in LG/SH. I’ve put up a copy.
What links here?
- Incorporating Mechanism Design Into Decision Theory by StrivingForLegibility (26 Jan 2024 18:25 UTC; 17 points)
- StrivingForLegibility 2 Feb 2024 4:45 UTC
  1 point
  0
  Parent
  Thank you! I’m interested in checking out earlier chapters to make sure I understand the notation, but here’s my current understanding:
  There are 7 axioms that go into Joyce’s representation theorem, and none of them seem to put any constraints on the set of actions available to the agent. So we should be able to ask a Joyce-rational agent to choose a policy for a game.
  My impression of the representation theorem is that a formula like $E U (a) := \sum_{j = 1}^{N} P (a ↪ o_{j}; x) \cdot U (o_{j})$ can represent a variety of decision theories. Including ones like CDT which are dynamically inconsistent: they have a well-defined answer to “what do you think is the best policy”, and it’s not necessarily consistent with their answer to “what are you actually going to do?”
  So it seems like the axioms are consistent with policy optimization, and they’re also consistent with action optimization. We can ask a decision theory to optimize a policy using an analogous expression: $E U (π) := \sum_{j = 1}^{N} P (π ↪ o_{j}; x) \cdot U (o_{j})$ .
  It seems like we should be able to get a lot of leverage by imposing a consistency requirement that these two expressions line up. It shouldn’t matter whether we optimize over actions or policies, the actions taken should be the same.
  I don’t expect that fully specifies how to calculate the counterfactual data structures $P (a ↪ o_{j}; x)$ and $P (π ↪ o_{j}; x)$ , even with Joyce’s other 7 axioms. But the first 7 didn’t rule out dynamic or counterfactual inconsistency, and this should at least narrow our search down to decision theories that are able to coordinate with themselves at other points in the game tree.