Herb Ingram comments on LLMs are (mostly) not helped by filler tokens

Herb Ingram 10 Aug 2023 22:52 UTC
2 points
0
There has been some talk recently about long “filler-like” input (e.g. “a a a a a [...]”) somewhat derailing GPT3&4, e.g. leading them to output what seems like random parts of it’s training data. Maybe this effect is worth mentioning and thinking about when trying to use filler input for other purposes.
What links here?
- gwern's comment on LLMs are (mostly) not helped by filler tokens by Kshitij Sachan (13 Aug 2023 1:39 UTC; 6 points)
- Quintin Pope 11 Aug 2023 0:55 UTC
  5 points
  1
  Parent
  My assumption is that GPT-4 has a repetition penalty, so if you make it predict all the same phrase over and over again, it puts almost all its probability on a token that the repetition penalty prevents it from saying, with the leftover probability acting similarly to a max entropy distribution over the rest of the vocab.
  - dr_s 13 Aug 2023 9:29 UTC
    1 point
    0
    Parent
    This happens with GPT-3.5 too, BTW.
  - Kshitij Sachan 11 Aug 2023 5:46 UTC
    1 point
    0
    Parent
    By repetition penalty do you mean an explicit logit bias when sampling or internally it’s generalized to avoiding repeated tokens?