ryan_greenblatt comments on Transformers Represent Belief State Geometry in their Residual Stream

ryan_greenblatt 18 Apr 2024 3:02 UTC
7 points
3
+1 to this comment, also I expect the importance of activations being optimized for predicting future tokens to increase considerably with scale. (E.g., GPT-4 level compute maybe just gets you a GPT-3 level model if you enforce no such optimization with a stop grad.)