Annah comments on Classifying representations of sparse autoencoders (SAEs)

Annah 17 Nov 2023 19:50 UTC
1 point
0
Yeah, this makes a ton of sense. Thx for taking the time to give it a closer look and also your detailed response :)
So then in order for the SAE to be useful I’d have to train it on a lot of sentiment data and then I could maybe discover some interpretable sentiment related features that could help me understand why a model thinks a review is positive/negative...