TurnTrout comments on Environmental Structure Can Cause Instrumental Convergence

TurnTrout 30 Jul 2021 18:55 UTC
LW: 2 AF: 2
AF
Added to the post:
Relatedly [to power-seeking under the simplicity prior], Rohin Shah wrote:
if you know that an agent is maximizing the expectation of an explicitly represented utility function, I would expect that to lead to goal-driven behavior most of the time, since the utility function must be relatively simple if it is explicitly represented, and simple utility functions seem particularly likely to lead to goal-directed behavior.