RobertM comments on RobertM’s Shortform

RobertM 14 Sep 2024 21:31 UTC
8 points
0
Yeah, I meant terser compared to typical RLHD’d output from e.g. 4o. (I was looking at the traces they showed in https://openai.com/index/learning-to-reason-with-llms/).