Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Modeling People
Tag
Relevant
New
Old
Notes on Empathy
David Gross
May 3, 2022, 4:06 AM
27
points
2
comments
57
min read
LW
link
Utilitarianism and the replaceability of desires and attachments
MichaelStJules
Jul 27, 2024, 1:57 AM
5
points
2
comments
1
min read
LW
link
Really radical empathy
MichaelStJules
Jan 6, 2025, 5:46 PM
19
points
0
comments
1
min read
LW
link
Beyond algorithmic equivalence: self-modelling
Stuart_Armstrong
Feb 28, 2018, 4:55 PM
10
points
3
comments
1
min read
LW
link
How to Write Like Kaj Sotala
Matt Goldenberg
Jan 7, 2021, 7:33 PM
79
points
4
comments
5
min read
LW
link
Competent Preferences
Charlie Steiner
Sep 2, 2021, 2:26 PM
30
points
2
comments
6
min read
LW
link
Ways of being with you
KatjaGrace
Feb 5, 2021, 7:00 AM
7
points
2
comments
1
min read
LW
link
(worldspiritsockpuppet.com)
[Question]
Help me solve this problem: The basilisk isn’t real, but people are
canary_itm
Nov 26, 2023, 5:44 PM
−19
points
4
comments
1
min read
LW
link
Life at Three Tails of the Bell Curve
lsusr
Jun 27, 2020, 8:49 AM
65
points
10
comments
4
min read
LW
link
In praise of fake frameworks
Valentine
Jul 11, 2017, 2:12 AM
117
points
15
comments
7
min read
LW
link
How to understand people better
pwno
Oct 14, 2011, 7:53 PM
101
points
163
comments
5
min read
LW
link
Physical alignment—do you have it? Take a minute & check.
leggi
Feb 5, 2020, 4:02 AM
4
points
4
comments
1
min read
LW
link
Approving reinforces low-effort behaviors
Scott Alexander
Jul 17, 2011, 8:43 PM
164
points
25
comments
4
min read
LW
link
Getting rational now or later: navigating procrastination and time-inconsistent preferences for new rationalists
milo_thoughts
Feb 26, 2024, 7:38 PM
1
point
0
comments
8
min read
LW
link
Sequential Organization of Thinking: “Six Thinking Hats”
JustinShovelain
Mar 18, 2010, 5:22 AM
30
points
14
comments
3
min read
LW
link
Applied Bayes’ Theorem: Reading People
Kaj_Sotala
Jun 30, 2010, 5:21 PM
37
points
26
comments
8
min read
LW
link
[untitled post]
verwindung
Sep 14, 2023, 4:22 PM
1
point
0
comments
1
min read
LW
link
Bounded distrust or Bounded trust?
M. Y. Zuo
Oct 15, 2022, 4:41 PM
2
points
12
comments
3
min read
LW
link
Maps of Maps, and Empty Expectations
Nora_Ammann
May 3, 2021, 9:59 AM
29
points
5
comments
14
min read
LW
link
Why modelling multi-objective homeostasis is essential for AI alignment (and how it helps with AI safety as well)
Roland Pihlakas
Jan 12, 2025, 3:37 AM
46
points
7
comments
10
min read
LW
link
Mistakes as agency
pchvykov
Jul 25, 2022, 4:17 PM
12
points
8
comments
4
min read
LW
link
A gentle primer on caring, including in strange senses, with applications
Kaarel
Aug 30, 2022, 8:05 AM
10
points
4
comments
18
min read
LW
link
Some implications of radical empathy
MichaelStJules
Jan 7, 2025, 4:10 PM
3
points
0
comments
1
min read
LW
link
Building AI safety benchmark environments on themes of universal human values
Roland Pihlakas
Jan 3, 2025, 4:24 AM
18
points
3
comments
8
min read
LW
link
(docs.google.com)
No comments.
Back to top