jacob_drori comments on Circuits in Superposition: Compressing many small neural networks into one

jacob_drori 14 Oct 2024 17:25 UTC
1 point
0
I’m confused by the read-in bound:
$ϵ_{t}^{l, i n} = O (w a \sqrt{k T \frac{m d}{M D} log M})$
Sure, each neuron reads from $T \frac{n log M}{M}$ of the random subspaces. But in all but $k$ of those subspaces, the big network’s activations are smaller than $δ$ , right? So I was expecting a tighter bound—something like:
$ϵ_{t}^{l, i n} = O (w a \sqrt{(k + T δ) \frac{m d}{M D} log M})$
- Lucius Bushnaq 14 Oct 2024 17:48 UTC
  3 points
  0
  Parent
  EDIT: Sorry, misunderstood your question at first.
  
  Even if $δ = 0$ , all those subspaces will have some nonzero overlap $O (\frac{1}{\sqrt{D}})$ with the activation vectors of the $k$ active subnets. The subspaces of the different small networks in the residual stream aren’t orthogonal.
  - jacob_drori 14 Oct 2024 20:45 UTC
    3 points
    2
    Parent
    Ah, I think I understand. Let me write it out to double-check, and in case it helps others.
    
    Say $δ = 0$ , for simplicity. Then $A^{l} = \sum_{t} E_{t} a_{t}^{l}$ . This sum has $k$ nonzero terms.
    
    In your construction, $W^{l, i n} = \sum_{t} V_{t}^{l} W_{t}^{l, i n} E_{t}^{T}$ . Focussing on a single neuron, labelled by $i$ , we have $(W^{l, i n})_{i} = \sum_{t} (V_{t}^{l})_{i} W_{t}^{l, i n} E_{t}^{T}$ . This sum has $\sim p T$ nonzero terms.
    
    So the preactivation of an MLP hidden neuron in the big network is $p_{i}^{l} = \sum_{t, t^{'}} (V_{t}^{l})_{i} W_{t}^{l, i n} E_{t}^{T} E_{t^{'}} a_{t^{'}}^{l}$ . This sum has $\sim k p T$ nonzero terms.
    We only “want” the terms where $t = t^{'}$ ; the rest (i.e. the majority) are noise. Each noise term in the sum is a random vector, so each of the $\sim k p T$ different noise terms are roughly orthogonal, and so the norm of the noise is $O (\sqrt{k p T})$ (times some other factors, but this captures the $T$ -dependence, which is what I was confused about).
    - Lucius Bushnaq 14 Oct 2024 22:04 UTC
      2 points
      0
      Parent
      Yes, that’s right.
  - [ ]
    [deleted]