Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Physics of Language models (part 2.1)
Nathan Helm-Burger
19 Sep 2024 16:48 UTC
9
points
1
comment
1
min read
LW
link
AI
Interpretability (ML & AI)
Post permalink
Link without comments
Link without top nav bars
Link without comments or top nav bars
Link post
This is perhaps the best interpretability work I’ve seen outside of Chris Olah’s team.
Nathan Helm-Burger
19 Sep 2024 16:48 UTC
9
points
1
comment
1
min read
LW
link
AI
Interpretability (ML & AI)
Post permalink
Link without comments
Link without top nav bars
Link without comments or top nav bars
Back to top
Physics of Language models (part 2.1)
Link post
This is perhaps the best interpretability work I’ve seen outside of Chris Olah’s team.