Because some could be just calculations to get to a higher-level features (part of a circuit).
IMO, the intermediate steps should mostly be counted as features in their own right, but it’d depend on the circuit. The main reason I agree is that neurons probably still do some other stuff, eg memory management or signal boosting earlier directions in the residual stream.
Fair point, corrected.
IMO, the intermediate steps should mostly be counted as features in their own right, but it’d depend on the circuit. The main reason I agree is that neurons probably still do some other stuff, eg memory management or signal boosting earlier directions in the residual stream.