Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Arthur Conmy
Karma:
1,504
Intepretability
Views my own
All
Posts
Comments
New
Top
Old
Page
2
Mechanistically interpreting time in GPT-2 small
rgould
,
Elizabeth Ho
and
Arthur Conmy
Apr 16, 2023, 5:57 PM
68
points
6
comments
21
min read
LW
link
RLHF does not appear to differentially cause mode-collapse
Arthur Conmy
and
beren
Mar 20, 2023, 3:39 PM
95
points
9
comments
3
min read
LW
link
OpenAI introduce ChatGPT API at 1/10th the previous $/token
Arthur Conmy
Mar 1, 2023, 8:48 PM
28
points
4
comments
1
min read
LW
link
(openai.com)
Arthur Conmy’s Shortform
Arthur Conmy
Nov 1, 2022, 9:35 PM
2
points
1
comment
1
min read
LW
link
Some Lessons Learned from Studying Indirect Object Identification in GPT-2 small
KevinRoWang
,
Alexandre Variengien
,
Arthur Conmy
,
Buck
and
jsteinhardt
Oct 28, 2022, 11:55 PM
101
points
9
comments
9
min read
LW
link
2
reviews
(arxiv.org)
Previous
Back to top