davidad comments on You can still fetch the coffee today if you’re dead tomorrow

davidad 9 Dec 2022 17:56 UTC
LW: 15 AF: 9
0
AF
That’s true! Thanks for pointing this out; I added a subsection about it to the post. There are probably also a bunch of other cases I haven’t thought of that provide stories for how the environment directly rewards actions that go against the spirit of the shutdown criterion (besides imitation and this one, which I might call “trade”). This construction does nothing to counteract such incentives. Rather, it just avoids the way that being an infinite-horizon RL agent systematically creates new ones.
- particlemania 25 Dec 2022 21:20 UTC
  LW: 4 AF: 3
  0
  AF Parent
  As an addendum, it seems to me that you may not necessarily need a ‘long-term planner’ (or ‘time-unbounded agent’) in the environment. A similar outcome may also be attainable if the environment contains a tiling of time-bound agents who can all trade across each other in ways such that the overall trade network implements long term power seeking.