Charlie Steiner comments on Reward is not the optimization target

Charlie Steiner 1 Aug 2022 23:43 UTC
LW: 4 AF: 2
0
AF
But Agent 57 (or its successor) would go mash the button once it figured out how to do it. Kinda like the salt-starved rats from that one Steve Byrnes post. Put another way, my claim is that the architectural tweaks that let you beat Montezuma’s Revenge with RL are very similar to the architectural tweaks that make your agent act like it really is motivated by reward, across a broader domain.
- TurnTrout 7 Aug 2022 16:48 UTC
  LW: 2 AF: 2
  0
  AF Parent
  (Haven’t checked out Agent 57 in particular, but expect it to not have the “actually optimizes reward” property in the cases I argue against in the post.)