Neel Nanda comments on SolidGoldMagikarp (plus, prompt generation)

Neel Nanda 6 Feb 2023 21:44 UTC
LW: 17 AF: 8
1
AF
Oh wait, that FAQ is actually nothing to do with GPT-3. That’s about their embedding models, which map sequences of tokens to a single vector, and they’re saying that those are normalised. Which is nothing to do with the map from tokens to residual stream vectors in GPT-3, even though that also happens to be called an embedding
- Jessica Rumbelow 7 Feb 2023 10:48 UTC
  1 point
  0
  Parent
  Aha!! Thanks Neel, makes sense. I’ll update the post