Senthooran Rajamanoharan comments on Improving Dictionary Learning with Gated Sparse Autoencoders

Senthooran Rajamanoharan 29 Apr 2024 8:51 UTC
LW: 3 AF: 2
0
AF
On $b_{mag}$ , it’s unclear what a “natural” choice would be for setting this parameter in order to simplify the architecture further. One natural reference point is to set it to $e^{r_{mag}} ⊙ b_{gate}$ , but this corresponds to getting rid of the discontinuity in the Jump ReLU (turning the magnitude encoder into a ReLU on multiplicatively rescaled gate encoder preactivations). Effectively (removing the now unnecessary auxiliary task), this would give results similar to the “baseline + rescale & shift” benchmark in section 5.2 of the paper, although probably worse, as we wouldn’t have the shift.
- leogao 1 May 2024 0:00 UTC
  LW: 2 AF: 1
  0
  AF Parent
  Makes sense that the shift would be helpful