Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
hrdkbhatnagar
Karma:
127
All
Posts
Comments
New
Top
Old
Compositionality and Ambiguity: Latent Co-occurrence and Interpretable Subspaces
Matthew A. Clarke
,
hrdkbhatnagar
and
Joseph Bloom
Dec 20, 2024, 3:16 PM
32
points
0
comments
37
min read
LW
link
Toy Models of Feature Absorption in SAEs
chanind
,
hrdkbhatnagar
,
TomasD
and
Joseph Bloom
Oct 7, 2024, 9:56 AM
49
points
8
comments
10
min read
LW
link
[Paper] A is for Absorption: Studying Feature Splitting and Absorption in Sparse Autoencoders
chanind
,
TomasD
,
hrdkbhatnagar
and
Joseph Bloom
Sep 25, 2024, 9:31 AM
73
points
16
comments
3
min read
LW
link
(arxiv.org)
Back to top
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel