Lucius Bushnaq comments on Toward A Mathematical Framework for Computation in Superposition

Lucius Bushnaq 9 Feb 2024 14:16 UTC
2 points
0
Thinking the example through a bit further: In a ReLU layer, features are all confined to the positive quadrant. So superposed features computed in a ReLU layer all have positive inner product. So if I send the output of one ReLU layer implementing $n^{2}$ AND gates in superposition directly to another ReLU layer implementing another $n^{2}$ ANDs on a subset of the outputs of that previous layer^[1], the assumption that input directions are equally likely to have positive and negative inner products is not satisfied.
Maybe you can fix this with bias setoffs somehow? Not sure at the moment. But as currently written, it doesn’t seem like I can use the outputs of one layer performing a subset of ANDs as the inputs of another layer performing another subset of ANDs.

EDIT: Talked it through with Jake. Bias setoff can help, but it currently looks to us like you still end up with AND gates that share a variable systematically having positive sign in their inner product. Which might make it difficult to implement a valid general recipe for multi-step computation if you try to work out the details.
1. ^
  A very central use case for a superposed boolean general computer. Otherwise you don’t actually get to implement any serial computation.