I wonder whether you’d find a positive rather than negative correlation of token likelihood between davinci-002 and davinci-003 when looking at ranking logprob among all tokens rather than raw logprob which is pushed super low by the collapse?
I would guess it’s positive. I’ll check at some point and let you know.
I would guess it’s positive. I’ll check at some point and let you know.