The “Still no lie detector for language model” paper is here: https://arxiv.org/pdf/2307.00175The paper in the OP seems somewhat relate to my post from earlier this year.
The “Still no lie detector for language model” paper is here: https://arxiv.org/pdf/2307.00175
The paper in the OP seems somewhat relate to my post from earlier this year.