Yeah, this is a good point. If I were designing a system where humans gave feedback on a CoT type system, I’d want to sometimes not show the human the full reasoning trace or the answer, and just have them rate whether the reasoning traces so far seem to be on the right track.
Yeah, this is a good point. If I were designing a system where humans gave feedback on a CoT type system, I’d want to sometimes not show the human the full reasoning trace or the answer, and just have them rate whether the reasoning traces so far seem to be on the right track.