Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
Page
1
AGI Ruin: A List of Lethalities
Eliezer Yudkowsky
Jun 5, 2022, 10:05 PM
929
points
708
comments
30
min read
LW
link
3
reviews
Where I agree and disagree with Eliezer
paulfchristiano
Jun 19, 2022, 7:15 PM
898
points
223
comments
18
min read
LW
link
2
reviews
What an actually pessimistic containment strategy looks like
lc
Apr 5, 2022, 12:19 AM
679
points
138
comments
6
min read
LW
link
2
reviews
Simulators
janus
Sep 2, 2022, 12:45 PM
631
points
168
comments
41
min read
LW
link
8
reviews
(generative.ink)
Let’s think about slowing down AI
KatjaGrace
Dec 22, 2022, 5:40 PM
551
points
182
comments
38
min read
LW
link
3
reviews
(aiimpacts.org)
The Redaction Machine
Ben
Sep 20, 2022, 10:03 PM
503
points
48
comments
27
min read
LW
link
1
review
Luck based medicine: my resentful story of becoming a medical miracle
Elizabeth
Oct 16, 2022, 5:40 PM
488
points
121
comments
12
min read
LW
link
3
reviews
(acesounderglass.com)
Losing the root for the tree
Adam Zerner
Sep 20, 2022, 4:53 AM
480
points
31
comments
9
min read
LW
link
1
review
Counter-theses on Sleep
Natália
Mar 21, 2022, 11:21 PM
447
points
135
comments
15
min read
LW
link
1
review
It’s Probably Not Lithium
Natália
Jun 28, 2022, 9:24 PM
442
points
187
comments
28
min read
LW
link
1
review
chinchilla’s wild implications
nostalgebraist
Jul 31, 2022, 1:18 AM
424
points
128
comments
10
min read
LW
link
1
review
(My understanding of) What Everyone in Technical Alignment is Doing and Why
Thomas Larsen
and
elifland
Aug 29, 2022, 1:23 AM
413
points
90
comments
37
min read
LW
link
1
review
You Are Not Measuring What You Think You Are Measuring
johnswentworth
Sep 20, 2022, 8:04 PM
407
points
44
comments
8
min read
LW
link
2
reviews
It Looks Like You’re Trying To Take Over The World
gwern
Mar 9, 2022, 4:35 PM
407
points
120
comments
1
min read
LW
link
1
review
(www.gwern.net)
DeepMind alignment team opinions on AGI ruin arguments
Vika
Aug 12, 2022, 9:06 PM
395
points
37
comments
14
min read
LW
link
1
review
Reflections on six months of fatherhood
jasoncrawford
Jan 31, 2022, 5:28 AM
387
points
24
comments
4
min read
LW
link
1
review
(jasoncrawford.org)
Lies Told To Children
Eliezer Yudkowsky
Apr 14, 2022, 11:25 AM
381
points
94
comments
7
min read
LW
link
1
review
Reward is not the optimization target
TurnTrout
Jul 25, 2022, 12:03 AM
375
points
123
comments
10
min read
LW
link
3
reviews
A Mechanistic Interpretability Analysis of Grokking
Neel Nanda
and
Tom Lieberum
Aug 15, 2022, 2:41 AM
373
points
48
comments
36
min read
LW
link
1
review
(colab.research.google.com)
Counterarguments to the basic AI x-risk case
KatjaGrace
Oct 14, 2022, 1:00 PM
371
points
124
comments
34
min read
LW
link
1
review
(aiimpacts.org)
Without specific countermeasures, the easiest path to transformative AI likely leads to AI takeover
Ajeya Cotra
Jul 18, 2022, 7:06 PM
368
points
95
comments
75
min read
LW
link
1
review
Accounting For College Costs
johnswentworth
Apr 1, 2022, 5:28 PM
366
points
41
comments
7
min read
LW
link
Security Mindset: Lessons from 20+ years of Software Security Failures Relevant to AGI Alignment
elspood
Jun 21, 2022, 11:55 PM
362
points
42
comments
7
min read
LW
link
1
review
Staring into the abyss as a core life skill
benkuhn
Dec 22, 2022, 3:30 PM
354
points
22
comments
12
min read
LW
link
1
review
(www.benkuhn.net)
MIRI announces new “Death With Dignity” strategy
Eliezer Yudkowsky
Apr 2, 2022, 12:43 AM
354
points
546
comments
18
min read
LW
link
1
review
What DALL-E 2 can and cannot do
Swimmer963 (Miranda Dixon-Luinenburg)
May 1, 2022, 11:51 PM
353
points
303
comments
9
min read
LW
link
Beware boasting about non-existent forecasting track records
Jotto999
May 20, 2022, 7:20 PM
338
points
112
comments
5
min read
LW
link
1
review
What should you change in response to an “emergency”? And AI risk
AnnaSalamon
Jul 18, 2022, 1:11 AM
337
points
60
comments
6
min read
LW
link
1
review
Why I think strong general AI is coming soon
porby
Sep 28, 2022, 5:40 AM
336
points
141
comments
34
min read
LW
link
1
review
Looking back on my alignment PhD
TurnTrout
Jul 1, 2022, 3:19 AM
334
points
66
comments
11
min read
LW
link
Optimality is the tiger, and agents are its teeth
Veedrac
Apr 2, 2022, 12:46 AM
327
points
44
comments
16
min read
LW
link
1
review
Models Don’t “Get Reward”
Sam Ringer
Dec 30, 2022, 10:37 AM
313
points
61
comments
5
min read
LW
link
1
review
On how various plans miss the hard bits of the alignment challenge
So8res
Jul 12, 2022, 2:49 AM
313
points
89
comments
29
min read
LW
link
3
reviews
Six Dimensions of Operational Adequacy in AGI Projects
Eliezer Yudkowsky
May 30, 2022, 5:00 PM
310
points
66
comments
13
min read
LW
link
1
review
Epistemic Legibility
Elizabeth
Feb 9, 2022, 6:10 PM
309
points
30
comments
20
min read
LW
link
1
review
(acesounderglass.com)
Why Agent Foundations? An Overly Abstract Explanation
johnswentworth
Mar 25, 2022, 11:17 PM
302
points
58
comments
8
min read
LW
link
1
review
A challenge for AGI organizations, and a challenge for readers
Rob Bensinger
and
Eliezer Yudkowsky
Dec 1, 2022, 11:11 PM
302
points
33
comments
2
min read
LW
link
Two-year update on my personal AI timelines
Ajeya Cotra
Aug 2, 2022, 11:07 PM
293
points
60
comments
16
min read
LW
link
What Are You Tracking In Your Head?
johnswentworth
Jun 28, 2022, 7:30 PM
287
points
83
comments
4
min read
LW
link
1
review
Mysteries of mode collapse
janus
Nov 8, 2022, 10:37 AM
284
points
57
comments
14
min read
LW
link
1
review
Sazen
Duncan Sabien (Deactivated)
Dec 21, 2022, 7:54 AM
281
points
83
comments
12
min read
LW
link
2
reviews
We Choose To Align AI
johnswentworth
Jan 1, 2022, 8:06 PM
280
points
16
comments
3
min read
LW
link
1
review
Don’t die with dignity; instead play to your outs
Jeffrey Ladish
Apr 6, 2022, 7:53 AM
280
points
60
comments
5
min read
LW
link
Is AI Progress Impossible To Predict?
alyssavance
May 15, 2022, 6:30 PM
277
points
39
comments
2
min read
LW
link
A central AI alignment problem: capabilities generalization, and the sharp left turn
So8res
Jun 15, 2022, 1:10 PM
272
points
55
comments
10
min read
LW
link
1
review
Toni Kurz and the Insanity of Climbing Mountains
GeneSmith
Jul 3, 2022, 8:51 PM
271
points
67
comments
11
min read
LW
link
2
reviews
Humans are very reliable agents
alyssavance
Jun 16, 2022, 10:02 PM
269
points
35
comments
3
min read
LW
link
12 interesting things I learned studying the discovery of nature’s laws
Ben Pace
Feb 19, 2022, 11:39 PM
268
points
40
comments
9
min read
LW
link
1
review
Comment reply: my low-quality thoughts on why CFAR didn’t get farther with a “real/efficacious art of rationality”
AnnaSalamon
Jun 9, 2022, 2:12 AM
261
points
63
comments
17
min read
LW
link
1
review
Changing the world through slack & hobbies
Steven Byrnes
Jul 21, 2022, 6:11 PM
261
points
13
comments
10
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel