RSS

Fabien Roger

Karma: 2,539

Fermi es­ti­ma­tion of the im­pact you might have work­ing on AI safety

Fabien Roger13 May 2022 17:49 UTC
6 points
0 comments1 min readLW link

The im­pact you might have work­ing on AI safety

Fabien Roger29 May 2022 16:31 UTC
5 points
1 comment4 min readLW link

How To Know What the AI Knows—An ELK Distillation

Fabien Roger4 Sep 2022 0:46 UTC
7 points
0 comments5 min readLW link

A Mys­tery About High Di­men­sional Con­cept Encoding

Fabien Roger3 Nov 2022 17:05 UTC
46 points
13 comments7 min readLW link

By De­fault, GPTs Think In Plain Sight

Fabien Roger19 Nov 2022 19:15 UTC
85 points
33 comments9 min readLW link

Ex­tract­ing and Eval­u­at­ing Causal Direc­tion in LLMs’ Activations

14 Dec 2022 14:33 UTC
29 points
5 comments11 min readLW link