To head off a possible confusion come tomorrow, it seems like your definition of “narrow value learning” is a bit different from Paul’s. You define it as learning to produce desired behavior in some domain, while Paul defined it as learning instrumental goals and values. I think this means that under your definition, behavioral cloning and approval-directed agents are subsets of narrow value learning, whereas under Paul’s definition they are disjoint from narrow value learning. Does this seem right to you, and if so was this overloading of the term intentional?
Hmm, I agree that Paul’s definition is different from mine, but it feels to me like they are both pointing at the same thing.
I think this means that under your definition, behavioral cloning and approval-directed agents are subsets of narrow value learning
That’s right.
whereas under Paul’s definition they are disjoint from narrow value learning.
I’m not sure. I would have included them, because sufficiently good behavioral cloning/approval-directed agents would need to learn instrumental goals and values in order to work effectively in a domain.
was this overloading of the term intentional?
It was intentional, in that I thought that these were different ways of pointing at the same thing.
To head off a possible confusion come tomorrow, it seems like your definition of “narrow value learning” is a bit different from Paul’s. You define it as learning to produce desired behavior in some domain, while Paul defined it as learning instrumental goals and values. I think this means that under your definition, behavioral cloning and approval-directed agents are subsets of narrow value learning, whereas under Paul’s definition they are disjoint from narrow value learning. Does this seem right to you, and if so was this overloading of the term intentional?
Hmm, I agree that Paul’s definition is different from mine, but it feels to me like they are both pointing at the same thing.
That’s right.
I’m not sure. I would have included them, because sufficiently good behavioral cloning/approval-directed agents would need to learn instrumental goals and values in order to work effectively in a domain.
It was intentional, in that I thought that these were different ways of pointing at the same thing.