Physics of Language models (part 2.1)
Nathan Helm-Burger
19 Sep 2024 16:48 UTC
9
points
1
comment
1
min read
LW
link
AI
Interpretability (ML & AI)
Post permalink
Link without comments
Link without top nav bars
Link without comments or top nav bars
Link post
This is perhaps the best interpretability work I’ve seen outside of Chris Olah’s team.
Nathan Helm-Burger
19 Sep 2024 16:48 UTC
9
points
1
comment
1
min read
LW
link
AI
Interpretability (ML & AI)
Post permalink
Link without comments
Link without top nav bars
Link without comments or top nav bars
Back to top
Physics of Language models (part 2.1)
Link post
This is perhaps the best interpretability work I’ve seen outside of Chris Olah’s team.