Ethan Perez comments on OpenAI API base models are not sycophantic, at any size

Ethan Perez 29 Aug 2023 20:00 UTC
LW: 9 AF: 4
3
AF
Are you measuring the average probability the model places on the sycophantic answer, or the % of cases where the probability on the sycophantic answer exceeds the probability of the non-sycophantic answer? (I’d be interested to know both)