Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Philippe Chlenski
Karma:
76
CS PhD student
All
Posts
Comments
New
Top
Old
Transcoders enable fine-grained interpretable circuit analysis for language models
Jacob Dunefsky
,
Philippe Chlenski
and
Neel Nanda
30 Apr 2024 17:58 UTC
71
points
14
comments
17
min read
LW
link
Case Studies in Reverse-Engineering Sparse Autoencoder Features by Using MLP Linearization
Jacob Dunefsky
,
Philippe Chlenski
,
Senthooran Rajamanoharan
and
Neel Nanda
14 Jan 2024 2:06 UTC
23
points
0
comments
42
min read
LW
link
Back to top