eggsyntax comments on Transformers Represent Belief State Geometry in their Residual Stream

eggsyntax 19 Apr 2024 23:52 UTC
6 points
0
I struggled with the notation on the figures; this comment tries to clarify a few points for anyone else who may be confused by it.
- There are three main diagrams to pay attention to in order to understand what’s going on here:
  - The Z1R Process (this is a straightforward Hidden Markov Model diagram, look them up if it’s unclear).
  - The Z1R Mixed-State Presentation, representing the belief states of a model as it learns the underlying structure.
  - The Z1R Mixed-State Simplex. Importantly, unlike the other two this is a graph and spatial placement is meaningful.
- It’s better to ignore the numeric labels on the green nodes of the Mixed-State Presentation, at least until you’re clear about the rest. These labels are not uniquely determined, so the relationship between the subscripts can be very confusing. Just treat them as arbitrarily labeled distinct nodes whose only importance is the arrows leading in and out of them. Once you understand the rest you can go back and understand the subscripts if you want^[1].
- However, it’s important to note that the blue nodes are isomorphic to the Z1R Process diagram (n_101 = SR, n_11 = S0, n_00 = S1. Once the model has entered the correct blue node, it will thereafter be properly synchronized to the model. The green nodes are transient belief states that the model passes through on its way to fully learning the model.
- On the Mixed-State Simplex: I found the position on the diagram quite confusing at first. The important thing to remember is that the three corners represent certainty that the underlying process is in the equivalent state (eg the top corner is n_00 = S1). So for example if you look at the position of n_0, it indicates that the model is confident that the underlying process is definitely not in n_101 (SR), since it’s as far as possible from that corner. And the model believes that the process is more likely to be in n_00 (S1) than in n_11 (S0). Notice how this corresponds to the arrows leaving n_0 & their probabilities in the Mixed-State Presentation (67% chance of transitioning to n_101, 33% chance of transitioning to n_00).
- Some more detail on n_0 if it isn’t clear after the previous paragraph:
  - Looking at the mixed-state presentation, if we’re in n_0, we’ve just seen a 0.
  - That means that there’s a ²⁄₃ chance we’re currently in S1, and a ¹⁄₃ chance we’re currently in S0. And, of course, a 0 chance that we’re currently in SR.
  - Therefore the point on which n_0 should lie should be maximally far from the SR corner (n_101), and closer to the S1 corner (n_00) than to the S0 corner (n_11). Which is what we in fact see.
@Adam Shai please correct me if I got any of that wrong!

If anyone else is still confused about how the diagrams work after reading this, please comment! I’m happy to help, and it’ll show me what parts of this explanation are inadequate.
1. ^
  Here’s the details if you still want them after you’ve understood the rest. Each node label represents some path that could be taken to that node (& not to other nodes), but there can be multiple such paths. For example, n_11 could also be labeled as n_010, because those are both sequences that could have left us in that state. So as we take some path through the Mixed-State Presentation, we build up a path. If we start at n_s and follow the 1 path, we get to n_1. If we then follow the 0 path, we reach n_10. If we then follow the 0 path, the next node could be called n_100, reflecting the path we’ve taken. But in fact any path that ends with 00 will reach that node, so it’s just labeled n_00. So initially it seems as though we can just append the symbol emitted by whichever path we take, but often there’s some step where that breaks down and you get what initially seems like a totally random different label.
- Adam Shai 20 Apr 2024 1:47 UTC
  1 point
  0
  Parent
  This all looks correct to me! Thanks for this.