RSS

Daniel Tan(Daniel Tan)

Karma: 132

PhD student at UCL. Interested in mech interp.

The Resi­d­ual Ex­pan­sion: A Frame­work for think­ing about Trans­former Circuits

Daniel Tan2 Aug 2024 11:04 UTC
16 points
13 comments3 min readLW link

An In­ter­pretabil­ity Illu­sion from Pop­u­la­tion Statis­tics in Causal Analysis

Daniel Tan29 Jul 2024 14:50 UTC
9 points
3 comments1 min readLW link

Daniel Tan’s Shortform

Daniel Tan17 Jul 2024 6:38 UTC
2 points
32 comments1 min readLW link

Mech In­terp Lacks Good Paradigms

Daniel Tan16 Jul 2024 15:47 UTC
33 points
0 comments14 min readLW link

Ac­ti­va­tion Pat­tern SVD: A pro­posal for SAE Interpretability

Daniel Tan28 Jun 2024 22:12 UTC
15 points
2 comments2 min readLW link