RSS

Arthur Conmy

Karma: 1,504

Intepretability

Views my own

Mechanis­ti­cally in­ter­pret­ing time in GPT-2 small

Apr 16, 2023, 5:57 PM
68 points
6 comments21 min readLW link

RLHF does not ap­pear to differ­en­tially cause mode-collapse

Mar 20, 2023, 3:39 PM
95 points
9 comments3 min readLW link

OpenAI in­tro­duce ChatGPT API at 1/​10th the pre­vi­ous $/​token

Arthur ConmyMar 1, 2023, 8:48 PM
28 points
4 comments1 min readLW link
(openai.com)

Arthur Conmy’s Shortform

Arthur ConmyNov 1, 2022, 9:35 PM
2 points
1 comment1 min readLW link

Some Les­sons Learned from Study­ing Indi­rect Ob­ject Iden­ti­fi­ca­tion in GPT-2 small

Oct 28, 2022, 11:55 PM
101 points
9 comments9 min readLW link2 reviews
(arxiv.org)