Great critical awareness for evaluation research—I really endorse the CoT investigation. I’ve recently begun my own cross-model (Claude 3.7, GPT4.5, R1, Grok3) qualitative research project on sycophancy/people-pleasing behaviors, and the preliminary findings may point to an RLHF-induced response strategy, a meta-awareness of sorts regarding perceived evaluation contexts which seems to parallel your own findings! Following…
Great critical awareness for evaluation research—I really endorse the CoT investigation. I’ve recently begun my own cross-model (Claude 3.7, GPT4.5, R1, Grok3) qualitative research project on sycophancy/people-pleasing behaviors, and the preliminary findings may point to an RLHF-induced response strategy, a meta-awareness of sorts regarding perceived evaluation contexts which seems to parallel your own findings! Following…