RSS

Guillaume Corlouer

Karma: 120

An in­for­ma­tion-the­o­retic study of ly­ing in LLMs

Aug 2, 2024, 10:06 AM
17 points
0 comments4 min readLW link

De­gen­era­cies are sticky for SGD

Jun 16, 2024, 9:19 PM
56 points
1 comment16 min readLW link

Un­der­stand­ing mesa-op­ti­miza­tion us­ing toy models

May 7, 2023, 5:00 PM
43 points
2 comments10 min readLW link

Me­tal­ign­ment: De­con­fus­ing metaethics for AI al­ign­ment.

Guillaume CorlouerAug 23, 2019, 10:25 AM
13 points
7 comments3 min readLW link