Vivek Hebbar comments on Some Lessons Learned from Studying Indirect Object Identification in GPT-2 small

Vivek Hebbar 29 Oct 2022 13:14 UTC
LW: 1 AF: 1
0
AF
Exciting work! A minor question about the paper:
Does this mean that it writes a projection of S1′s positional embedding to S2′s residual stream? Or is it meant to say “writing to the position [residual stream] of [S2]”? Or something else?
- Alexandre Variengien 29 Oct 2022 18:44 UTC
  1 point
  0
  Parent
  Thanks for the feedback!
  Does this mean that it writes a projection of S1′s positional embedding to S2′s residual stream? Or is it meant to say “writing to the position [residual stream] of [S2]”? Or something else?
  Our current hypothesis is that they write some information about S1′s position (that we called the “position signal”, not as straightforward as a projection of its positional embedding) in the residual stream of S2. (See the paragraph “Locating the position signal.” in section 3.3). I hope this answer your questions.
  We currently think that the position signal is a relative pointer from S2 to S1, computed by the difference between the positions S2 and S1. However, our evidence for this claim is quite small (see the last paragraph of Appendix A).
  That’s definitely an exciting direction for future research!