dr_s comments on Transformers Represent Belief State Geometry in their Residual Stream

dr_s 20 Apr 2024 12:12 UTC
2 points
0
Given that the model eventually outputs the next token, shouldn’t the final embedding matrix be exactly your linear fit matrix multiplied by the probability of each state to output a given token? Could you use that?