Yes, tuned lens is an excellent tool and generally superior to the original logit lens. In this particular case, I don’t think it would show very different results, however (and in any case the logit lens is only a small part of the analysis), but I think it would be interesting to have some kind of integration with TransformerLens that enabled the training and usage of tuned lens as well.
Yes, tuned lens is an excellent tool and generally superior to the original logit lens. In this particular case, I don’t think it would show very different results, however (and in any case the logit lens is only a small part of the analysis), but I think it would be interesting to have some kind of integration with TransformerLens that enabled the training and usage of tuned lens as well.