gwern comments on Are language models good at making predictions?

gwern 8 Nov 2023 2:25 UTC
9 points
6
More specifically: https://arxiv.org/pdf/2303.08774.pdf#page=12

Dynomight, are you aware that, in addition to the GPT-4 paper reporting the RLHF’d GPT-4 being badly de-calibrated, there’s several papers already examining the calibration and ability of LLMs to forecast?