Tao Lin comments on Seriously, what goes wrong with “reward the agent when it makes you smile”?

Tao Lin 15 Aug 2022 14:35 UTC
1 point
0
Also, I think if you trained something to predict text, then RL trained it on inclusive genetic fitness as a human (or human motivation signals), its learning would be mostly in the space of “select specific human / subdistribution of humans to imitate” rather than learning behaviors specific to the task, and then its generalization properties would depend more on those humans than on the specific training setup used