Yeah, I said “goal inference” instead of “value learning” but I mean the same thing. The “ambitious” part is that we are trying to do much better than humans, which I was taking for granted in this post (it’s six months older than ambitious vs. narrow value learning).
Yeah, I said “goal inference” instead of “value learning” but I mean the same thing. The “ambitious” part is that we are trying to do much better than humans, which I was taking for granted in this post (it’s six months older than ambitious vs. narrow value learning).