Alex Flint comments on Polysemanticity and Capacity in Neural Networks

Alex Flint 11 Oct 2022 14:07 UTC
LW: 2 AF: 1
0
AF
Thanks for this.

we don’t have a generic technique to define capacity across different architectures and loss functions

Got it. I imagine that for some particular architectures, and given some particular network weights, you can numerically compute the marginal returns to capacity curves, but that it’s hard to express capacity analytically as a function of network weights since you really need to know what the particular features are in order to compute returns to capacity—is that correct?