Wei Dai comments on Will humans build goal-directed agents?

Wei Dai 16 Feb 2019 4:56 UTC
LW: 8 AF: 3
AF

Can approval-directed agents be considered a form of imitation learning, and if not, are there any safety-relevant differences between imitation learning of (speeded-up) humans, and approval-directed agents?

I found an old comment from Paul that answers this:

I think that the only reason to be interested in approval-directed agents rather than straightforward imitation learners is that it may be harder to effectively imitate behavior than to solve the same task in a very different way.