cfoster0 comments on Steering GPT-2-XL by adding an activation vector

cfoster0 15 May 2023 17:30 UTC
5 points
0
I wonder if this is related to how GPT-J runs the attention and MLP sublayers in parallel, as opposed to sequentially?