Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Coherence Arguments
Tag
Relevant
New
Old
Coherent decisions imply consistent utilities
Eliezer Yudkowsky
May 12, 2019, 9:33 PM
149
points
82
comments
26
min read
LW
link
3
reviews
There are no coherence theorems
Dan H
and
EJT
Feb 20, 2023, 9:25 PM
149
points
130
comments
19
min read
LW
link
1
review
A Simple Toy Coherence Theorem
johnswentworth
and
David Lorell
Aug 2, 2024, 5:47 PM
74
points
22
comments
7
min read
LW
link
[Question]
Why The Focus on Expected Utility Maximisers?
DragonGod
Dec 27, 2022, 3:49 PM
118
points
84
comments
3
min read
LW
link
Coherence arguments do not entail goal-directed behavior
Rohin Shah
Dec 3, 2018, 3:26 AM
134
points
69
comments
7
min read
LW
link
3
reviews
[Question]
What do coherence arguments actually prove about agentic behavior?
sunwillrise
Jun 1, 2024, 9:37 AM
123
points
37
comments
6
min read
LW
link
Coherence of Caches and Agents
johnswentworth
Apr 1, 2024, 11:04 PM
77
points
9
comments
11
min read
LW
link
Counting-down vs. counting-up coherence
TsviBT
Feb 27, 2023, 2:59 PM
29
points
4
comments
13
min read
LW
link
Contra “Strong Coherence”
DragonGod
Mar 4, 2023, 8:05 PM
39
points
24
comments
1
min read
LW
link
[Question]
Is “Strong Coherence” Anti-Natural?
DragonGod
Apr 11, 2023, 6:22 AM
23
points
25
comments
2
min read
LW
link
Coherence arguments imply a force for goal-directed behavior
KatjaGrace
Mar 26, 2021, 4:10 PM
91
points
25
comments
11
min read
LW
link
1
review
(aiimpacts.org)
When Most VNM-Coherent Preference Orderings Have Convergent Instrumental Incentives
TurnTrout
Aug 9, 2021, 5:22 PM
53
points
4
comments
5
min read
LW
link
The hot mess theory of AI misalignment: More intelligent agents behave less coherently
Jonathan Yan
Mar 10, 2023, 12:20 AM
48
points
21
comments
1
min read
LW
link
(sohl-dickstein.github.io)
Coherent behaviour in the real world is an incoherent concept
Richard_Ngo
Feb 11, 2019, 5:00 PM
51
points
17
comments
9
min read
LW
link
Measuring Coherence and Goal-Directedness in RL Policies
dx26
Apr 22, 2024, 6:26 PM
10
points
0
comments
7
min read
LW
link
The Impossibility of a Rational Intelligence Optimizer
Nicolas Villarreal
Jun 6, 2024, 4:14 PM
−9
points
5
comments
14
min read
LW
link
[Question]
Money Pump Arguments assume Memoryless Agents. Isn’t this Unrealistic?
Dalcy
Aug 16, 2024, 4:16 AM
26
points
6
comments
1
min read
LW
link
It Can’t Be Mesa-Optimizers All The Way Down (Or Else It Can’t Be Long-Term Supercoherence?)
Austin Witte
Mar 31, 2023, 7:21 AM
20
points
5
comments
4
min read
LW
link
Let’s look for coherence theorems
Valdes
May 7, 2023, 2:45 PM
25
points
18
comments
6
min read
LW
link
[Linkpost] Will AI avoid exploitation?
cdkg
Aug 6, 2023, 2:28 PM
22
points
1
comment
1
min read
LW
link
Do incoherent entities have stronger reason to become more coherent than less?
KatjaGrace
Jun 30, 2021, 5:50 AM
46
points
5
comments
4
min read
LW
link
(worldspiritsockpuppet.com)
Three ways that “Sufficiently optimized agents appear coherent” can be false
Wei Dai
Mar 5, 2019, 9:52 PM
65
points
3
comments
3
min read
LW
link
Comment on Coherence arguments do not imply goal directed behavior
Ronny Fernandez
Dec 6, 2019, 9:30 AM
30
points
8
comments
5
min read
LW
link
[Question]
Is there a “coherent decisions imply consistent utilities”-style argument for non-lexicographic preferences?
Tetraspace
Jun 29, 2021, 7:14 PM
4
points
20
comments
1
min read
LW
link
Deriving Conditional Expected Utility from Pareto-Efficient Decisions
Thomas Kwa
May 5, 2022, 3:21 AM
24
points
1
comment
6
min read
LW
link
The “Measuring Stick of Utility” Problem
johnswentworth
May 25, 2022, 4:17 PM
74
points
25
comments
3
min read
LW
link
[Request for Distillation] Coherence of Distributed Decisions With Different Inputs Implies Conditioning
johnswentworth
Apr 25, 2022, 5:01 PM
22
points
14
comments
2
min read
LW
link
No comments.
Back to top
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel