Logan Riggs comments on We Found An Neuron in GPT-2

Logan Riggs Feb 12, 2023, 5:25 PM
3 points
0
One reason the neuron is congruent with multiple of the same tokens may be because those token embeddings are similar (you can test this by checking their cosine similarities).
- Tom Lieberum Feb 12, 2023, 9:41 PM
  1 point
  0
  Parent
  Yup! I think that’d be quite interesting. Is there any work on characterizing the embedding space of GPT2?
  - LawrenceC Feb 13, 2023, 8:08 PM
    4 points
    0
    Parent
    Adam Scherlis did some preliminary exploration here:
    https://www.lesswrong.com/posts/BMghmAxYxeSdAteDc/an-exploration-of-gpt-2-s-embedding-weights
    
    Here’s a more thorough investigation of the overall shape of said embeddings with interactive figures:
    https://bert-vs-gpt2.dbvis.de/
  - LawrenceC Feb 13, 2023, 9:45 PM
    3 points
    0
    Parent
    There’s also a lot of academic work on the geometry of LM embeddings, e.g.:
    https://openreview.net/forum?id=xYGNO86OWDH (BERT, ERNIE)
    https://arxiv.org/abs/2209.02535 (GPT-2-medium)
    (Plus a mountain more on earlier text/token embeddings like Word2Vec.)
  - Butanium Feb 13, 2023, 1:37 PM
    1 point
    0
    Parent
    https://www.lesswrong.com/posts/aPeJE8bSo6rAFoLqg/solidgoldmagikarp-plus-prompt-generation is related to the embedding space