RSS

TomasD

Karma: 114

Toy Models of Fea­ture Ab­sorp­tion in SAEs

7 Oct 2024 9:56 UTC
49 points
8 comments10 min readLW link

[Paper] A is for Ab­sorp­tion: Study­ing Fea­ture Split­ting and Ab­sorp­tion in Sparse Autoencoders

25 Sep 2024 9:31 UTC
71 points
16 comments3 min readLW link
(arxiv.org)

To­masD’s Shortform

TomasD14 Mar 2024 15:03 UTC
1 point
0 comments1 min readLW link