Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
hrdkbhatnagar
Karma:
97
All
Posts
Comments
New
Top
Old
Toy Models of Feature Absorption in SAEs
chanind
,
hrdkbhatnagar
,
TomasD
and
Joseph Bloom
7 Oct 2024 9:56 UTC
46
points
7
comments
10
min read
LW
link
[Paper] A is for Absorption: Studying Feature Splitting and Absorption in Sparse Autoencoders
chanind
,
TomasD
,
hrdkbhatnagar
and
Joseph Bloom
25 Sep 2024 9:31 UTC
69
points
15
comments
3
min read
LW
link
(arxiv.org)
Back to top