Gurkenglas comments on Vanessa Kosoy’s Shortform

Gurkenglas 7 Dec 2019 13:03 UTC
LW: 1 AF: 1
AF
What do you mean by equivalent? The entire history doesn’t say what the opponent will do later or would do against other agents, and the source code may not allow you to prove what the agent does if it involves statements that are true but not provable.
- Vanessa Kosoy 7 Dec 2019 22:39 UTC
  LW: 2 AF: 1
  AF Parent
  For a fixed policy, the history is the only thing you need to know in order to simulate the agent on a given round. In this sense, seeing the history is equivalent to seeing the source code.
  
  The claim is: In settings where the agent has unlimited memory and sees the entire history or source code, you can’t get good guarantees (as in the folk theorem for repeated games). On the other hand, in settings where the agent sees part of the history, or is constrained to have finite memory (possibly of size $O (log \frac{1}{1 - γ})$ ?), you can (maybe?) prove convergence to Pareto efficient outcomes or some other strong desideratum that deserves to be called “superrationality”.