Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
lsgos
Karma:
227
All
Posts
Comments
New
Top
Old
Improving Dictionary Learning with Gated Sparse Autoencoders
Senthooran Rajamanoharan
,
Arthur Conmy
,
lsgos
,
Tom Lieberum
,
Vikrant Varma
,
János Kramár
,
Rohin Shah
and
Neel Nanda
25 Apr 2024 18:43 UTC
62
points
35
comments
1
min read
LW
link
(arxiv.org)
[Full Post] Progress Update #1 from the GDM Mech Interp Team
Neel Nanda
,
Arthur Conmy
,
lsgos
,
Senthooran Rajamanoharan
,
Tom Lieberum
,
János Kramár
and
Vikrant Varma
19 Apr 2024 19:06 UTC
71
points
8
comments
8
min read
LW
link
[Summary] Progress Update #1 from the GDM Mech Interp Team
Neel Nanda
,
Arthur Conmy
,
lsgos
,
Senthooran Rajamanoharan
,
Tom Lieberum
,
János Kramár
and
Vikrant Varma
19 Apr 2024 19:06 UTC
68
points
0
comments
3
min read
LW
link
Dropout can create a privileged basis in the ReLU output model.
lsgos
28 Apr 2023 1:59 UTC
24
points
3
comments
5
min read
LW
link
Back to top