TurnTrout comments on Subagents and impact measures, full and fully illustrated

TurnTrout 25 Feb 2020 15:24 UTC
4 points
Mind-reading violates the cartesian assumption and so we can’t reason about it formally (yet!), but i think there’s a version of effectively getting what you’re after that doesn’t.
- Stuart_Armstrong 25 Feb 2020 16:12 UTC
  2 points
  Parent
  Well, as long as $S A$ is wired to “get out of the way if $A$ starts moving”, then the optimal $R$ -maximising policy is always to move towards the red button; anything else is clearly not $R$ -maximising (note that $S A$ doesn’t need to “know” anything; just be programmed to have a different policy depending on how $A$ moves, with $A$ itself setting this up to signal whether it’s $R$ -maximising or not).
  
  But in any case, that specific problem can be overcome with the right rollouts.