RSS

Max Harms

Karma: 435

Also known as Raelifin: https://​​www.lesswrong.com/​​users/​​raelifin

In­stru­men­tal vs Ter­mi­nal Desiderata

Max Harms26 Jun 2024 20:57 UTC
21 points
0 comments3 min readLW link

Max Harms’s Shortform

Max Harms13 Jun 2024 18:19 UTC
3 points
1 comment1 min readLW link

5. Open Cor­rigi­bil­ity Questions

Max Harms10 Jun 2024 14:09 UTC
22 points
0 comments7 min readLW link

4. Ex­ist­ing Writ­ing on Corrigibility

Max Harms10 Jun 2024 14:08 UTC
47 points
15 comments106 min readLW link

3b. For­mal (Faux) Corrigibility

Max Harms9 Jun 2024 17:18 UTC
19 points
13 comments17 min readLW link

3a. Towards For­mal Corrigibility

Max Harms9 Jun 2024 16:53 UTC
22 points
2 comments19 min readLW link

2. Cor­rigi­bil­ity Intuition

Max Harms8 Jun 2024 15:52 UTC
65 points
10 comments33 min readLW link

1. The CAST Strategy

Max Harms7 Jun 2024 22:29 UTC
46 points
19 comments38 min readLW link

0. CAST: Cor­rigi­bil­ity as Sin­gu­lar Target

Max Harms7 Jun 2024 22:29 UTC
137 points
12 comments8 min readLW link