More specifically: https://arxiv.org/pdf/2303.08774.pdf#page=12
Dynomight, are you aware that, in addition to the GPT-4 paper reporting the RLHF’d GPT-4 being badly de-calibrated, there’s several papers already examining the calibration and ability of LLMs to forecast?
More specifically: https://arxiv.org/pdf/2303.08774.pdf#page=12
Dynomight, are you aware that, in addition to the GPT-4 paper reporting the RLHF’d GPT-4 being badly de-calibrated, there’s several papers already examining the calibration and ability of LLMs to forecast?