Wei Dai comments on Will humans build goal-directed agents?

Wei Dai 5 Jan 2019 23:27 UTC
LW: 6 AF: 3
AF
That’s a good question. It looks like imitation learning actually covers a number of ML techniques (see this) none of which exactly matches approval-directed agents. But the category seems broad enough that I think approval-directed agents can be considered to be a form of imitation learning. In particular, IRL is considered a form of imitation learning and IRL would also be able to perform actions that the human would not have thought of doing themselves.
- Rohin Shah 6 Jan 2019 2:04 UTC
  LW: 3 AF: 1
  AF Parent
  ^ Yes to all of this.
  A little bit of nuance: IRL is considered to be a form of imitation learning because in many cases the inferred reward in IRL is only meant to reproduce the human’s performance and isn’t expected to generalize outside of the training distribution.
  There are versions of IRL which are meant to go beyond imitation. For example, adversarial IRL was trying to infer a reward that would generalize to new environments, in which case it would be doing something more than imitation.