Wei Dai comments on What is narrow value learning?

Wei Dai 10 Jan 2019 9:23 UTC
LW: 3 AF: 2
AF
To head off a possible confusion come tomorrow, it seems like your definition of “narrow value learning” is a bit different from Paul’s. You define it as learning to produce desired behavior in some domain, while Paul defined it as learning instrumental goals and values. I think this means that under your definition, behavioral cloning and approval-directed agents are subsets of narrow value learning, whereas under Paul’s definition they are disjoint from narrow value learning. Does this seem right to you, and if so was this overloading of the term intentional?
What links here?
- riceissa's comment on List of resolved confusions about IDA by Wei Dai (9 Oct 2019 5:16 UTC; 4 points)
- Rohin Shah 10 Jan 2019 17:00 UTC
  LW: 3 AF: 2
  AF Parent
  Hmm, I agree that Paul’s definition is different from mine, but it feels to me like they are both pointing at the same thing.
  I think this means that under your definition, behavioral cloning and approval-directed agents are subsets of narrow value learning
  That’s right.
  whereas under Paul’s definition they are disjoint from narrow value learning.
  I’m not sure. I would have included them, because sufficiently good behavioral cloning/approval-directed agents would need to learn instrumental goals and values in order to work effectively in a domain.
  was this overloading of the term intentional?
  It was intentional, in that I thought that these were different ways of pointing at the same thing.