paulfchristiano comments on Thoughts on sharing information about language model capabilities

paulfchristiano 1 Aug 2023 18:37 UTC
LW: 21 AF: 10
17
AF
Note that Evals has just published a description of some of their work evaluating GPT-4 and Claude. Their publication does not include transcripts, the details of the LM agents they evaluated, or detailed qualitative discussion of the strengths and weaknesses of the agents they evaluated. I believe that eventually Evals should be considerably more liberal about sharing this kind of information; my post is explaining why I believe that.