Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
KevinRoWang
Karma:
231
https://kevinrowang.com/
All
Posts
Comments
New
Top
Old
Some Lessons Learned from Studying Indirect Object Identification in GPT-2 small
KevinRoWang
,
Alexandre Variengien
,
Arthur Conmy
,
Buck
and
jsteinhardt
28 Oct 2022 23:55 UTC
101
points
9
comments
9
min read
LW
link
2
reviews
(arxiv.org)
Gears-Level Mental Models of Transformer Interpretability
KevinRoWang
29 Mar 2022 20:09 UTC
72
points
4
comments
6
min read
LW
link
Lessons After a Couple Months of Trying to Do ML Research
KevinRoWang
22 Mar 2022 23:45 UTC
70
points
8
comments
6
min read
LW
link
Back to top