Jacob_Hilton comments on Don’t you think RLHF solves outer alignment?

Jacob_Hilton 8 Nov 2022 0:05 UTC
1 point
0
I just meant that the usual RLHF setup is essentially RL in which the reward is provided by a learned model, but I agree that I was stretching the way the terminology is normally used.