drocta comments on Paper: Teaching GPT3 to express uncertainty in words

drocta 31 May 2022 19:58 UTC
2 points
I was surprised by how the fine-tuning was done for the verbalized confidence.
My initial expectation was that it would make the loss be based on like, some scoring rule based on the probability expressed and the right answer.
Though, come to think of it, I guess seeing as it would be assigning logits values to different expressions of probabilities, it would have to… what, take the weighted average of the scores it would get if it gave the different probabilities? And, I suppose that if many training steps were done on the same question/answer pairs, then the confidences might just get pushed towards 0% or 100%?
Ah, but for the indirect logit it was trained using the “is it right or wrong” with the cross-entropy loss thing. Ok, cool.
- Owain_Evans 1 Jun 2022 9:42 UTC
  3 points
  Parent
  The indirect logit is trained with cross-entropy based on the groundtruth correct answer. You can’t do this for verbalized probability without using RL, and so we instead do supervised learning using the empirical accuracy for different question types as the labels.