Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Daniel Tan
(Daniel Tan)
Karma:
132
PhD student at UCL. Interested in mech interp.
All
Posts
Comments
New
Top
Old
The Residual Expansion: A Framework for thinking about Transformer Circuits
Daniel Tan
2 Aug 2024 11:04 UTC
16
points
13
comments
3
min read
LW
link
An Interpretability Illusion from Population Statistics in Causal Analysis
Daniel Tan
29 Jul 2024 14:50 UTC
9
points
3
comments
1
min read
LW
link
Daniel Tan’s Shortform
Daniel Tan
17 Jul 2024 6:38 UTC
2
points
32
comments
1
min read
LW
link
Mech Interp Lacks Good Paradigms
Daniel Tan
16 Jul 2024 15:47 UTC
33
points
0
comments
14
min read
LW
link
Activation Pattern SVD: A proposal for SAE Interpretability
Daniel Tan
28 Jun 2024 22:12 UTC
15
points
2
comments
2
min read
LW
link
Back to top