All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

AllJan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec

OpenAI: Facts from a Weekend

ZviNov 20, 2023, 3:30 PM

271 points

165 comments9 min readLW link

(thezvi.wordpress.com)

Dear Self; we need to talk about ambition

ElizabethAug 27, 2023, 11:10 PM

270 points

28 comments8 min readLW link 2 reviews

(acesounderglass.com)

The Base Rate Times, news through prediction markets

vandemonianJun 6, 2023, 5:42 PM

268 points

41 comments4 min readLW link 1 review

Pausing AI Developments Isn’t Enough. We Need to Shut it All Down

Eliezer YudkowskyApr 8, 2023, 12:36 AM

268 points

44 comments12 min readLW link 1 review

My May 2023 priorities for AI x-safety: more empathy, more unification of concerns, and less vilification of OpenAI

Andrew_CritchMay 24, 2023, 12:02 AM

268 points

39 comments8 min readLW link

Discussion with Nate Soares on a key alignment difficulty

HoldenKarnofskyMar 13, 2023, 9:20 PM

265 points

43 comments22 min readLW link 1 review

Constellations are Younger than Continents

Jeffrey HeningerDec 19, 2023, 6:12 AM

263 points

21 comments2 min readLW link

[SEE NEW EDITS] No, You Need to Write Clearer

Nicholas / Heather KrossApr 29, 2023, 5:04 AM

262 points

65 comments5 min readLW link

(www.thinkingmuchbetter.com)

“Carefully Bootstrapped Alignment” is organizationally hard

RaemonMar 17, 2023, 6:00 PM

262 points

23 comments11 min readLW link 1 review

UFO Betting: Put Up or Shut Up

RatsWrongAboutUAPJun 13, 2023, 4:05 AM

259 points

216 comments2 min readLW link 1 review

Alignment Implications of LLM Successes: a Debate in One Act

Zack_M_DavisOct 21, 2023, 3:22 PM

258 points

55 comments13 min readLW link 2 reviews

My Model Of EA Burnout

LoganStrohlJan 25, 2023, 5:52 PM

258 points

50 comments5 min readLW link 1 review

Mental Health and the Alignment Problem: A Compilation of Resources (updated April 2023)

Chris Scammell and DivineMango

May 10, 2023, 7:04 PM

256 points

54 comments21 min readLW link

Thoughts on the impact of RLHF research

paulfchristianoJan 25, 2023, 5:23 PM

253 points

102 comments9 min readLW link

You Don’t Exist, Duncan

Duncan Sabien (Deactivated)Feb 2, 2023, 8:37 AM

252 points

107 comments9 min readLW link

Deep Deceptiveness

So8resMar 21, 2023, 2:51 AM

251 points

60 comments14 min readLW link 1 review

My Assessment of the Chinese AI Safety Community

Lao MeinApr 25, 2023, 4:21 AM

250 points

94 comments3 min readLW link

My views on “doom”

paulfchristianoApr 27, 2023, 5:50 PM

250 points

37 comments2 min readLW link 1 review

(ai-alignment.com)

Yes, It’s Subjective, But Why All The Crabs?

johnswentworthJul 28, 2023, 7:35 PM

250 points

15 comments6 min readLW link

I hired 5 people to sit behind me and make me productive for a month

Simon BerensFeb 5, 2023, 1:19 AM

249 points

83 comments10 min readLW link

(www.simonberens.com)

On AutoGPT

ZviApr 13, 2023, 12:30 PM

248 points

47 comments20 min readLW link

(thezvi.wordpress.com)

Lessons On How To Get Things Right On The First Try

johnswentworth and David Lorell

Jun 19, 2023, 11:58 PM

245 points

57 comments10 min readLW link 1 review

Munk AI debate: confusions and possible cruxes

Steven ByrnesJun 27, 2023, 2:18 PM

244 points

21 comments8 min readLW link

Book Review: Going Infinite

ZviOct 24, 2023, 3:00 PM

242 points

113 comments97 min readLW link 1 review

(thezvi.wordpress.com)

Natural Abstractions: Key claims, Theorems, and Critiques

LawrenceC, Leon Lang and Erik Jenner

Mar 16, 2023, 4:37 PM

241 points

23 comments45 min readLW link 3 reviews

Sum-threshold attacks

TsviBTSep 8, 2023, 5:13 PM

238 points

55 comments10 min readLW link

(tsvibt.blogspot.com)

Self-driving car bets

paulfchristianoJul 29, 2023, 6:10 PM

236 points

44 comments5 min readLW link

(sideways-view.com)

AI Control: Improving Safety Despite Intentional Subversion

Buck, Fabien Roger, ryan_greenblatt and Kshitij Sachan

Dec 13, 2023, 3:51 PM

236 points

24 comments10 min readLW link 4 reviews

Cultivating a state of mind where new ideas are born

Henrik KarlssonJul 27, 2023, 9:16 AM

235 points

21 comments14 min readLW link 2 reviews

(www.henrikkarlsson.xyz)

More information about the dangerous capability evaluations we did with GPT-4 and Claude.

Beth BarnesMar 19, 2023, 12:25 AM

233 points

54 comments8 min readLW link

(evals.alignment.org)

Policy discussions follow strong contextualizing norms

Richard_NgoApr 1, 2023, 11:51 PM

230 points

61 comments3 min readLW link

What are the results of more parental supervision and less outdoor play?

juliawiseNov 25, 2023, 12:52 PM

228 points

31 comments5 min readLW link

AGI in sight: our look at the game board

Andrea_Miotti and Gabriel Alfour

Feb 18, 2023, 10:17 PM

227 points

135 comments6 min readLW link

(andreamiotti.substack.com)

Ways I Expect AI Regulation To Increase Extinction Risk

1a3ornJul 4, 2023, 5:32 PM

225 points

32 comments7 min readLW link

Elements of Rationalist Discourse

Rob BensingerFeb 12, 2023, 7:58 AM

224 points

49 comments3 min readLW link 1 review

Recursive Middle Manager Hell

RaemonJan 1, 2023, 4:33 AM

224 points

46 comments11 min readLW link 1 review

Announcing MIRI’s new CEO and leadership team

Gretta DulebaOct 10, 2023, 7:22 PM

222 points

52 comments3 min readLW link

Thoughts on responsible scaling policies and regulation

paulfchristianoOct 24, 2023, 10:21 PM

221 points

33 comments6 min readLW link

Catching the Eye of Sauron

Casey B.Apr 7, 2023, 12:40 AM

221 points

68 comments4 min readLW link

What I would do if I wasn’t at ARC Evals

LawrenceCSep 5, 2023, 7:19 PM

220 points

10 comments13 min readLW link 1 review

UDT shows that decision theory is more puzzling than ever

Wei DaiSep 13, 2023, 12:26 PM

218 points

56 comments1 min readLW link

AI presidents discuss AI alignment agendas

TurnTrout and Garrett Baker

Sep 9, 2023, 6:55 PM

217 points

23 comments1 min readLW link

(www.youtube.com)

Orthogonal: A new agent foundations alignment organization

Tamsin LeakeApr 19, 2023, 8:17 PM

217 points

4 comments1 min readLW link

(orxl.org)

Enemies vs Malefactors

So8resFeb 28, 2023, 11:38 PM

217 points

69 comments LW link 4 reviews

Announcing Apollo Research

Marius Hobbhahn, beren, Lee Sharkey, Lucius Bushnaq, Dan Braun, Mikita Balesni and Jérémy Scheurer

May 30, 2023, 4:17 PM

217 points

11 comments8 min readLW link

Eliezer Yudkowsky’s Letter in Time Magazine

ZviApr 5, 2023, 6:00 PM

214 points

86 comments14 min readLW link

(thezvi.wordpress.com)

Updates and Reflections on Optimal Exercise after Nearly a Decade

romeostevensitJun 8, 2023, 11:02 PM

213 points

57 comments2 min readLW link 1 review

An AI risk argument that resonates with NYTimes readers

Julian BradshawMar 12, 2023, 11:09 PM

212 points

14 comments1 min readLW link

Consciousness as a conflationary alliance term for intrinsically valued internal experiences

Andrew_CritchJul 10, 2023, 8:09 AM

212 points

54 comments11 min readLW link 2 reviews

Actually, Othello-GPT Has A Linear Emergent World Representation

Neel NandaMar 29, 2023, 10:13 PM

211 points

26 comments19 min readLW link

(neelnanda.io)

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer