Nathan Helm-Burger comments on RobertM’s Shortform

Nathan Helm-Burger 14 Sep 2024 17:07 UTC
2 points
0
Yeah, this is a good point. If I were designing a system where humans gave feedback on a CoT type system, I’d want to sometimes not show the human the full reasoning trace or the answer, and just have them rate whether the reasoning traces so far seem to be on the right track.