RSS

ntt123

Karma: 15

Logit Prisms: De­com­pos­ing Trans­former Out­puts for Mechanis­tic Interpretability

ntt123Jun 17, 2024, 11:46 AM
5 points
4 comments6 min readLW link
(neuralblog.github.io)

Ex­plor­ing Llama-3-8B MLP Neurons

ntt123Jun 9, 2024, 2:19 PM
10 points
0 comments4 min readLW link
(neuralblog.github.io)