TurnTrout comments on Reward is not the optimization target

TurnTrout 7 Aug 2022 16:48 UTC
LW: 2 AF: 2
0
AF
(Haven’t checked out Agent 57 in particular, but expect it to not have the “actually optimizes reward” property in the cases I argue against in the post.)