RSS

Charbel-Raphaël

Karma: 2,162

Charbel-Raphael Segerie

https://​​crsegerie.github.io/​​

Living in Paris

Im­prove­ment on MIRI’s Corrigibility

Jun 9, 2023, 4:10 PM
54 points
8 comments13 min readLW link

Thriv­ing in the Weird Times: Prepar­ing for the 100X Economy

May 8, 2023, 1:44 PM
23 points
16 comments2 min readLW link

Davi­dad’s Bold Plan for Align­ment: An In-Depth Explanation

Apr 19, 2023, 4:09 PM
168 points
40 comments21 min readLW link2 reviews

New Hackathon: Ro­bust­ness to dis­tri­bu­tion changes and ambiguity

Charbel-RaphaëlJan 31, 2023, 12:50 PM
12 points
3 comments1 min readLW link

Com­pendium of prob­lems with RLHF

Charbel-RaphaëlJan 29, 2023, 11:40 AM
120 points
16 comments10 min readLW link

[Question] Don’t you think RLHF solves outer al­ign­ment?

Charbel-RaphaëlNov 4, 2022, 12:36 AM
9 points
23 comments1 min readLW link

Easy fix­ing Voting

Charbel-RaphaëlOct 2, 2022, 5:03 PM
12 points
2 comments1 min readLW link

Open ap­pli­ca­tion to be­come an AI safety pro­ject mentor

Charbel-RaphaëlSep 29, 2022, 11:27 AM
10 points
0 comments1 min readLW link
(docs.google.com)

[Question] Help me find a good Hackathon sub­ject

Charbel-RaphaëlSep 4, 2022, 8:40 AM
6 points
18 comments1 min readLW link

[Question] How to im­press stu­dents with re­cent ad­vances in ML?

Charbel-RaphaëlJul 14, 2022, 12:03 AM
12 points
2 comments1 min readLW link

[Question] Is it de­sir­able for the first AGI to be con­scious?

Charbel-RaphaëlMay 1, 2022, 9:29 PM
5 points
12 comments1 min readLW link