Tao Lin comments on SolidGoldMagikarp (plus, prompt generation)

Tao Lin 6 Feb 2023 0:09 UTC
10 points
2
I have personally observed completely untrained tokens in gpt2. Specifically I found some specific accented characters had very small and random embeddings, which were so similar it looked like none of them had any training at all