RSS

In­ter­nal Align­ment (Hu­man)

TagLast edit: Jan 13, 2021, 10:54 PM by plex

Internal Alignment is a broadly desirable state. By default, humans sometimes have internal conflict. You might frame that as conflict between subagents, or subprocesses within the human. You might instead frame it as a single agent making complicated decisions. The “internal alignment” hypothesis is that you can become much more productive/​happier/​fulfilled by getting yourself into alignment with yourself.

Notes on Integrity

David GrossDec 3, 2020, 11:42 PM
18 points
1 comment8 min readLW link

The shard the­ory of hu­man values

Sep 4, 2022, 4:28 AM
255 points
67 comments24 min readLW link2 reviews

If you are too stressed, walk away from the front lines

Neil Jun 12, 2023, 2:26 PM
44 points
14 comments5 min readLW link

Non-Co­er­cive Perfectionism

Matt GoldenbergJan 26, 2021, 4:53 PM
25 points
25 comments3 min readLW link

An­nounc­ing the Align­ment of Com­plex Sys­tems Re­search Group

Jun 4, 2022, 4:10 AM
91 points
20 comments5 min readLW link

Trust de­vel­ops grad­u­ally via mak­ing bids and set­ting boundaries

Richard_NgoMay 19, 2023, 10:16 PM
133 points
12 comments4 min readLW link

In­ter­nal com­mu­ni­ca­tion framework

Nov 15, 2022, 12:41 PM
38 points
13 comments12 min readLW link

Please don’t throw your mind away

TsviBTFeb 15, 2023, 9:41 PM
359 points
48 comments18 min readLW link1 review

Ar­tifi­cial Mo­ral Ad­vi­sors: A New Per­spec­tive from Mo­ral Psychology

David GrossAug 28, 2022, 4:37 PM
25 points
1 comment1 min readLW link
(dl.acm.org)

Tidy­ing One’s Room

ZviAug 16, 2018, 1:50 PM
36 points
3 comments4 min readLW link
(thezvi.wordpress.com)

My Model Of EA Burnout

LoganStrohlJan 25, 2023, 5:52 PM
256 points
50 comments5 min readLW link1 review

In­te­grat­ing dis­agree­ing subagents

Kaj_SotalaMay 14, 2019, 2:06 PM
147 points
15 comments21 min readLW link