Koen.Holtman comments on Instrumental Convergence For Realistic Agent Objectives

Koen.Holtman 31 Jan 2022 19:32 UTC
LW: 1 AF: 1
AF
instrumental convergence basically disappears for agents with utility functions over action-observation histories.

Wait, I am puzzled. Have you just completely changed your mind about the preconditions needed to get a power-seeking agent? The way the above reads is: just add some observation of actions to your realistic utility function, and you instrumental convergence problem is solved.
1. u-AOH (utility functions over action-observation histories): No IC
2. u-OH (utility functions over observation histories): Strong IC
There are many utility functions in u-AOH that simply ignore the A part of the history, so these would then have Strong IC because they are u-OH functions. So are you are making a subtle mathematical point about how these will average away to zero (given various properties of infinite sets), or am I missing something?
- TurnTrout 31 Jan 2022 20:09 UTC
  LW: 2 AF: 2
  AF Parent
  I recommend reading the quoted post for clarification.