Quintin Pope comments on LLMs are (mostly) not helped by filler tokens

Quintin Pope 11 Aug 2023 0:55 UTC
5 points
1
My assumption is that GPT-4 has a repetition penalty, so if you make it predict all the same phrase over and over again, it puts almost all its probability on a token that the repetition penalty prevents it from saying, with the leftover probability acting similarly to a max entropy distribution over the rest of the vocab.
- dr_s 13 Aug 2023 9:29 UTC
  1 point
  0
  Parent
  This happens with GPT-3.5 too, BTW.
- Kshitij Sachan 11 Aug 2023 5:46 UTC
  1 point
  0
  Parent
  By repetition penalty do you mean an explicit logit bias when sampling or internally it’s generalized to avoiding repeated tokens?