Yes, I mean that those measurements don’t really speak directly to the question of whether you’d be safer using RLHF or imitation learning.
Yes, I mean that those measurements don’t really speak directly to the question of whether you’d be safer using RLHF or imitation learning.