Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
Power Buys You Distance From The Crime
Elizabeth
2 Aug 2019 20:50 UTC
209
points
75
comments
7
min read
LW
link
1
review
(acesounderglass.com)
Why Subagents?
johnswentworth
1 Aug 2019 22:17 UTC
174
points
48
comments
7
min read
LW
link
1
review
The Commitment Races problem
Daniel Kokotajlo
23 Aug 2019 1:58 UTC
152
points
56
comments
5
min read
LW
link
Soft takeoff can still lead to decisive strategic advantage
Daniel Kokotajlo
23 Aug 2019 16:39 UTC
122
points
47
comments
8
min read
LW
link
4
reviews
Subagents, trauma and rationality
Kaj_Sotala
14 Aug 2019 13:14 UTC
111
points
4
comments
19
min read
LW
link
Trauma, Meditation, and a Cool Scar
Logan Riggs
6 Aug 2019 16:17 UTC
102
points
17
comments
5
min read
LW
link
1
review
[Question]
Can we really prevent all warming for less than 10B$ with the mostly side-effect free geoengineering technique of Marine Cloud Brightening?
mako yass
5 Aug 2019 0:12 UTC
94
points
55
comments
2
min read
LW
link
Partial summary of debate with Benquo and Jessicata [pt 1]
Raemon
14 Aug 2019 20:02 UTC
89
points
63
comments
22
min read
LW
link
3
reviews
Subagents, neural Turing machines, thought selection, and blindspots
Kaj_Sotala
6 Aug 2019 21:15 UTC
87
points
3
comments
12
min read
LW
link
Troll Bridge
abramdemski
23 Aug 2019 18:36 UTC
86
points
59
comments
12
min read
LW
link
2-D Robustness
Vlad Mikulik
30 Aug 2019 20:27 UTC
85
points
8
comments
2
min read
LW
link
Clarifying some key hypotheses in AI alignment
Ben Cottier
and
Rohin Shah
15 Aug 2019 21:29 UTC
79
points
12
comments
9
min read
LW
link
Problems in AI Alignment that philosophers could potentially contribute to
Wei Dai
17 Aug 2019 17:38 UTC
78
points
14
comments
2
min read
LW
link
Markets are Universal for Logical Induction
johnswentworth
22 Aug 2019 6:44 UTC
75
points
2
comments
5
min read
LW
link
Classifying specification problems as variants of Goodhart’s Law
Vika
19 Aug 2019 20:40 UTC
72
points
5
comments
5
min read
LW
link
1
review
Six AI Risk/Strategy Ideas
Wei Dai
27 Aug 2019 0:40 UTC
69
points
17
comments
4
min read
LW
link
1
review
[Question]
Does Agent-like Behavior Imply Agent-like Architecture?
Scott Garrabrant
23 Aug 2019 2:01 UTC
66
points
8
comments
1
min read
LW
link
Response to Glen Weyl on Technocracy and the Rationalist Community
John_Maxwell
22 Aug 2019 23:14 UTC
66
points
9
comments
10
min read
LW
link
[Question]
Why so much variance in human intelligence?
Ben Pace
22 Aug 2019 22:36 UTC
65
points
28
comments
4
min read
LW
link
Book Review: Secular Cycles
Scott Alexander
13 Aug 2019 4:10 UTC
62
points
10
comments
16
min read
LW
link
1
review
(slatestarcodex.com)
Dual Wielding
Zvi
27 Aug 2019 14:10 UTC
60
points
23
comments
2
min read
LW
link
3
reviews
(thezvi.wordpress.com)
How to Make Billions of Dollars Reducing Loneliness
John_Maxwell
30 Aug 2019 17:30 UTC
60
points
32
comments
7
min read
LW
link
Schelling Categories, and Simple Membership Tests
Zack_M_Davis
26 Aug 2019 2:43 UTC
59
points
10
comments
8
min read
LW
link
Tabooing ‘Agent’ for Prosaic Alignment
Hjalmar_Wijk
23 Aug 2019 2:55 UTC
57
points
10
comments
6
min read
LW
link
Actually updating
SaraHax
23 Aug 2019 17:46 UTC
56
points
10
comments
4
min read
LW
link
Intentional Bucket Errors
Scott Garrabrant
22 Aug 2019 20:02 UTC
55
points
6
comments
3
min read
LW
link
Permissions in Governance
sarahconstantin
2 Aug 2019 19:50 UTC
53
points
12
comments
8
min read
LW
link
(srconstantin.wordpress.com)
A Personal Rationality Wishlist
DanielFilan
27 Aug 2019 3:40 UTC
53
points
54
comments
4
min read
LW
link
(danielfilan.com)
Computational Model: Causal Diagrams with Symmetry
johnswentworth
22 Aug 2019 17:54 UTC
53
points
29
comments
4
min read
LW
link
AI Forecasting Dictionary (Forecasting infrastructure, part 1)
jacobjacob
and
bgold
8 Aug 2019 16:10 UTC
50
points
0
comments
5
min read
LW
link
Vaniver’s View on Factored Cognition
Vaniver
23 Aug 2019 2:54 UTC
48
points
4
comments
8
min read
LW
link
Status 451 on Diagnosis: Russell Aphasia
Zack_M_Davis
6 Aug 2019 4:43 UTC
48
points
1
comment
1
min read
LW
link
(status451.com)
September Bragging Thread
Raemon
30 Aug 2019 21:58 UTC
47
points
12
comments
1
min read
LW
link
Towards a mechanistic understanding of corrigibility
evhub
22 Aug 2019 23:20 UTC
47
points
26
comments
4
min read
LW
link
[Question]
How Can People Evaluate Complex Questions Consistently?
Elizabeth
26 Aug 2019 20:33 UTC
46
points
12
comments
1
min read
LW
link
[Link] Book Review: Reframing Superintelligence (SSC)
ioannes
28 Aug 2019 22:57 UTC
46
points
9
comments
2
min read
LW
link
Zeno walks into a bar
lsusr
4 Aug 2019 7:00 UTC
44
points
4
comments
2
min read
LW
link
New paper: Corrigibility with Utility Preservation
Koen.Holtman
6 Aug 2019 19:04 UTC
44
points
11
comments
2
min read
LW
link
Embedded Agency via Abstraction
johnswentworth
26 Aug 2019 23:03 UTC
42
points
20
comments
11
min read
LW
link
My recommendations for gratitude exercises
MaxCarpendale
5 Aug 2019 19:04 UTC
40
points
3
comments
5
min read
LW
link
The Missing Math of Map-Making
johnswentworth
28 Aug 2019 21:18 UTC
40
points
8
comments
2
min read
LW
link
Cephaloponderings
Jacob Falkovich
4 Aug 2019 16:45 UTC
39
points
4
comments
7
min read
LW
link
Call for contributors to the Alignment Newsletter
Rohin Shah
21 Aug 2019 18:21 UTC
39
points
0
comments
4
min read
LW
link
LW Team Updates—September 2019
Ruby
29 Aug 2019 22:12 UTC
39
points
13
comments
2
min read
LW
link
Epistemic Spot Check: The Fate of Rome (Kyle Harper)
Elizabeth
24 Aug 2019 21:40 UTC
39
points
3
comments
5
min read
LW
link
(acesounderglass.com)
Unstriving
Jacob Falkovich
19 Aug 2019 14:31 UTC
38
points
7
comments
6
min read
LW
link
Diana Fleischman and Geoffrey Miller—Audience Q&A
Jacob Falkovich
10 Aug 2019 22:37 UTC
38
points
6
comments
9
min read
LW
link
Optimization Provenance
Adele Lopez
23 Aug 2019 20:08 UTC
38
points
5
comments
5
min read
LW
link
Mistake Versus Conflict Theory of Against Billionaire Philanthropy
Zvi
1 Aug 2019 13:10 UTC
36
points
34
comments
3
min read
LW
link
(thezvi.wordpress.com)
Verification and Transparency
DanielFilan
8 Aug 2019 1:50 UTC
35
points
6
comments
2
min read
LW
link
(danielfilan.com)
Back to top
Next