RSS

WCargo

Karma: 162

(Léo Dana) French master student in applied Mathematics (probability & statistic), soon PhD in Mathematics in Paris

Vi­su­al­iz­ing small At­ten­tion-only Transformers

WCargoNov 19, 2024, 9:37 AM
4 points
0 comments8 min readLW link

Re­sults from the Tur­ing Sem­i­nar hackathon

Dec 7, 2023, 2:50 PM
29 points
1 comment6 min readLW link

On In­ter­pretabil­ity’s Robustness

WCargoOct 18, 2023, 1:18 PM
11 points
0 comments4 min readLW link

In­tro­duc­ing EffiS­ciences’ AI Safety Unit

Jun 30, 2023, 7:44 AM
68 points
0 comments12 min readLW link

Im­prove­ment on MIRI’s Corrigibility

Jun 9, 2023, 4:10 PM
54 points
8 comments13 min readLW link

A Cor­rigi­bil­ity Me­taphore—Big Gambles

WCargoMay 10, 2023, 6:13 PM
16 points
0 comments4 min readLW link