RSS

Raymond D

Karma: 1,063

Grad­ual Disem­pow­er­ment: Sys­temic Ex­is­ten­tial Risks from In­cre­men­tal AI Development

Jan 30, 2025, 5:03 PM
159 points
52 comments2 min readLW link
(gradual-disempowerment.ai)

What does suc­cess look like?

Raymond DJan 23, 2025, 5:48 PM
11 points
0 comments3 min readLW link

The Choice Transition

Nov 18, 2024, 12:30 PM
50 points
4 comments15 min readLW link
(strangecities.substack.com)

De­com­pos­ing Agency — ca­pa­bil­ities with­out desires

Jul 11, 2024, 9:38 AM
146 points
32 comments12 min readLW link
(strangecities.substack.com)

ChatGPT can learn in­di­rect control

Raymond DMar 21, 2024, 9:11 PM
213 points
27 comments1 min readLW link

Pre­dic­tive model agents are sort of corrigible

Raymond DJan 5, 2024, 2:05 PM
35 points
6 comments3 min readLW link

Pick­ing Men­tors For Re­search Programmes

Raymond DNov 10, 2023, 1:01 PM
105 points
8 comments4 min readLW link