Physics of Language models (part 2.1)

Nathan Helm-Burger19 Sep 2024 16:48 UTC

9 points

2 comments1 min readLW link

AI Interpretability (ML & AI)

This is perhaps the best interpretability work I’ve seen outside of Chris Olah’s team.

Nathan Helm-Burger19 Sep 2024 16:48 UTC

9 points

2 comments1 min readLW link

AI Interpretability (ML & AI)