Peter Lai

Karma: 35

Mechanistic Interpretability Enthusiast

Proof-of-Concept Debugger for a Small LLM

Peter Lai and StefanHex

Mar 17, 2025, 10:27 PM

27 points

0 comments11 min readLW link

SAE regularization produces more interpretable models

Peter Lai and StefanHex

Jan 28, 2025, 8:02 PM

21 points

7 comments4 min readLW link

Peter Lai’s Shortform

Peter LaiJan 25, 2025, 7:41 PM

3 points

0 comments LW link