All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025

All Jan Feb Mar Apr MayJunJul Aug Sep Oct Nov Dec

All1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

AGI Ruin: A List of Lethalities

Eliezer YudkowskyJun 5, 2022, 10:05 PM

936 points

708 comments30 min readLW link 3 reviews

Where I agree and disagree with Eliezer

paulfchristianoJun 19, 2022, 7:15 PM

899 points

223 comments18 min readLW link 2 reviews

It’s Probably Not Lithium

NatáliaJun 28, 2022, 9:24 PM

442 points

187 comments28 min readLW link 1 review

Security Mindset: Lessons from 20+ years of Software Security Failures Relevant to AGI Alignment

elspoodJun 21, 2022, 11:55 PM

362 points

42 comments7 min readLW link 1 review

What Are You Tracking In Your Head?

johnswentworthJun 28, 2022, 7:30 PM

289 points

83 comments4 min readLW link 1 review

A central AI alignment problem: capabilities generalization, and the sharp left turn

So8resJun 15, 2022, 1:10 PM

272 points

55 comments10 min readLW link 1 review

Humans are very reliable agents

alyssavanceJun 16, 2022, 10:02 PM

269 points

35 comments3 min readLW link

Comment reply: my low-quality thoughts on why CFAR didn’t get farther with a “real/efficacious art of rationality”

AnnaSalamonJun 9, 2022, 2:12 AM

261 points

63 comments17 min readLW link 1 review

Slow motion videos as AI risk intuition pumps

Andrew_CritchJun 14, 2022, 7:31 PM

241 points

41 comments2 min readLW link 1 review

Contra Hofstadter on GPT-3 Nonsense

ricticJun 15, 2022, 9:53 PM

237 points

24 comments2 min readLW link

AGI Safety FAQ / all-dumb-questions-allowed thread

Aryeh EnglanderJun 7, 2022, 5:47 AM

227 points

526 comments4 min readLW link

The prototypical catastrophic AI action is getting root access to its datacenter

BuckJun 2, 2022, 11:46 PM

180 points

13 comments2 min readLW link 1 review

The inordinately slow spread of good AGI conversations in ML

Rob BensingerJun 21, 2022, 4:09 PM

173 points

62 comments8 min readLW link

Announcing the Inverse Scaling Prize ($250k Prize Pool)

Ethan Perez, Ian McKenzie and Sam Bowman

Jun 27, 2022, 3:58 PM

171 points

14 comments7 min readLW link

AI Could Defeat All Of Us Combined

HoldenKarnofskyJun 9, 2022, 3:50 PM

170 points

42 comments17 min readLW link

(www.cold-takes.com)

On A List of Lethalities

ZviJun 13, 2022, 12:30 PM

165 points

50 comments54 min readLW link 1 review

(thezvi.wordpress.com)

A transparency and interpretability tech tree

evhubJun 16, 2022, 11:44 PM

163 points

11 comments18 min readLW link 1 review

Deep Learning Systems Are Not Less Interpretable Than Logic/Probability/Etc

johnswentworthJun 4, 2022, 5:41 AM

159 points

55 comments2 min readLW link 1 review

Godzilla Strategies

johnswentworthJun 11, 2022, 3:44 PM

159 points

72 comments3 min readLW link

Why all the fuss about recursive self-improvement?

So8resJun 12, 2022, 8:53 PM

158 points

62 comments7 min readLW link 1 review

Limits to Legibility

Jan_KulveitJun 29, 2022, 5:42 PM

157 points

11 comments5 min readLW link 1 review

Nonprofit Boards are Weird

HoldenKarnofskyJun 23, 2022, 2:40 PM

156 points

26 comments20 min readLW link 1 review

(www.cold-takes.com)

LessWrong Has Agree/Disagree Voting On All New Comment Threads

Ben PaceJun 24, 2022, 12:43 AM

154 points

217 comments2 min readLW link 1 review

Staying Split: Sabatini and Social Justice

Duncan Sabien (Deactivated)Jun 8, 2022, 8:32 AM

153 points

28 comments21 min readLW link

Steam

abramdemskiJun 20, 2022, 5:38 PM

149 points

13 comments5 min readLW link 1 review

[Question] why assume AGIs will optimize for fixed goals?

nostalgebraistJun 10, 2022, 1:28 AM

147 points

60 comments4 min readLW link 2 reviews

Public beliefs vs. Private beliefs

Eli TyreJun 1, 2022, 9:33 PM

144 points

30 comments5 min readLW link

A descriptive, not prescriptive, overview of current AI Alignment Research

Jan, Logan Riggs, jacquesthibs and janus

Jun 6, 2022, 9:59 PM

139 points

21 comments7 min readLW link

Announcing the LessWrong Curated Podcast

Ben Pace and Solenoid_Entity

Jun 22, 2022, 10:16 PM

137 points

27 comments1 min readLW link

AI-Written Critiques Help Humans Notice Flaws

paulfchristianoJun 25, 2022, 5:22 PM

137 points

5 comments3 min readLW link

(openai.com)

Contra EY: Can AGI destroy us without trial & error?

nsokolskyJun 13, 2022, 6:26 PM

137 points

72 comments15 min readLW link

Will Capabilities Generalise More?

Ramana KumarJun 29, 2022, 5:12 PM

133 points

39 comments4 min readLW link

Intergenerational trauma impeding cooperative existential safety efforts

Andrew_CritchJun 3, 2022, 8:13 AM

129 points

29 comments3 min readLW link

Confused why a “capabilities research is good for alignment progress” position isn’t discussed more

Kaj_SotalaJun 2, 2022, 9:41 PM

129 points

27 comments4 min readLW link

“Pivotal Acts” means something specific

RaemonJun 7, 2022, 9:56 PM

127 points

23 comments2 min readLW link

Let’s See You Write That Corrigibility Tag

Eliezer YudkowskyJun 19, 2022, 9:11 PM

124 points

70 comments1 min readLW link

Scott Aaronson is joining OpenAI to work on AI safety

peterbarnettJun 18, 2022, 4:06 AM

117 points

31 comments1 min readLW link

(scottaaronson.blog)

CFAR Handbook: Introduction

CFAR!DuncanJun 28, 2022, 4:53 PM

116 points

12 comments1 min readLW link

Leaving Google, Joining the Nucleic Acid Observatory

jefftkJun 10, 2022, 5:00 PM

114 points

4 comments3 min readLW link

(www.jefftk.com)

Conversation with Eliezer: What do you want the system to do?

Orpheus16Jun 25, 2022, 5:36 PM

114 points

38 comments2 min readLW link

Who models the models that model models? An exploration of GPT-3′s in-context model fitting ability

LovreJun 7, 2022, 7:37 PM

112 points

16 comments9 min readLW link

Relationship Advice Repository

RubyJun 20, 2022, 2:39 PM

109 points

36 comments38 min readLW link

wrapper-minds are the enemy

nostalgebraistJun 17, 2022, 1:58 AM

104 points

43 comments8 min readLW link

Yes, AI research will be substantially curtailed if a lab causes a major disaster

lcJun 14, 2022, 10:17 PM

103 points

31 comments2 min readLW link

The Mountain Troll

lsusrJun 11, 2022, 9:14 AM

103 points

26 comments2 min readLW link

Units of Exchange

CFAR!DuncanJun 28, 2022, 4:53 PM

99 points

28 comments11 min readLW link

Pivotal outcomes and pivotal processes

Andrew_CritchJun 17, 2022, 11:43 PM

97 points

31 comments4 min readLW link

Announcing Epoch: A research organization investigating the road to Transformative AI

Jsevillamol, Pablo Villalobos, Tamay, lennart, Marius Hobbhahn and anson.ho

Jun 27, 2022, 1:55 PM

97 points

2 comments2 min readLW link

(epochai.org)

My current take on Internal Family Systems “parts”

Kaj_SotalaJun 26, 2022, 5:40 PM

96 points

11 comments3 min readLW link

(kajsotala.fi)

Contest: An Alien Message

DaemonicSigilJun 27, 2022, 5:54 AM

95 points

100 comments1 min readLW link

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer