AlexMennen comments on A universal score for optimizers

AlexMennen 11 Jul 2018 4:11 UTC
LW: 6 AF: 3
AF
Some undesirable properties of C-score:
It depends on how the space of actions are represented. If a set of very similar actions that achieve the same utility for the agent are merged into one action, this will change the agent’s C-score.
It does not depend on the magnitudes of the agent’s preferences, only on their orderings. Compare 2 agents: the first has 3 available actions, which would give it utilities 0, .9, and 1, respectively, and it picks the action that would give it utility .9. The second has 3 available actions, which would give it utilities 0, .1, and 1, respectively, and it picks the action that would give it utility .1. Intuitively, the first agent is a more successful optimizer, but both agents have the same C-score.
What links here?
- Alignment Newsletter #15: 07/16/18 by Rohin Shah (16 Jul 2018 16:10 UTC; 42 points)
- levin 11 Jul 2018 17:37 UTC
  LW: 3 AF: 2
  AF Parent
  I agree with the first point, and I don’t have solid solutions to this. There’s also the fact that some games are easier to optimize than others (name a number game I described at the end vs. chess), and this complexity is impossible to capture while staying computation-agnostic. Maybe one can use the length of the shortest proof that taking action a leads to utility u(a) to account for these issues..
  The second point is more controversial, my intuition is that first agent is an equally good optimizer, even if it did better in terms of payoffs. Also, at least in the setting of deterministic games, utility functions are arbitrary up to encoding the same preference orderings (once randomness is introduced this stops being true)
  - AlexMennen 11 Jul 2018 18:34 UTC
    LW: 3 AF: 2
    AF Parent
    I think decision problems with incomplete information are a better model in which to measure optimization power than deterministic decision problems with complete information are. If the agent knows exactly what payoffs it would get from each action, it is hard to explain why it might not choose the optimal one. In the example I gave, the first agent could have mistakenly concluded that the .9-utility action was better than the 1-utility action while making only small errors in estimating the consequences of each of its actions, while the second agent would need to make large errors in estimating the consequences of its actions in order to think that the .1-utility action was better than the 1-utility action.