John_Maxwell comments on Reward function learning: the learning process

John_Maxwell 1 May 2018 6:21 UTC
4 points
0

That’s precisely my definition for “unriggable” learning processes, in the next post:https://www.lesswrong.com/posts/upLot6eG8cbXdKiFS/reward-function-learning-the-learning-process

That’s a link to this post, right? ;)
- Stuart_Armstrong 1 May 2018 6:32 UTC
  2 points
  0
  Parent
  Ooops, yes! Sorry, for some reason, I thought this was the post on the value function.