All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025

AllJan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec

Safetywashing

Adam SchollJul 1, 2022, 11:56 AM

260 points

20 comments1 min readLW link 2 reviews

So, geez there’s a lot of AI content these days

RaemonOct 6, 2022, 9:32 PM

258 points

140 comments6 min readLW link

Sexual Abuse attitudes might be infohazardous

Pseudonymous OtterJul 19, 2022, 6:06 PM

256 points

72 comments1 min readLW link

The shard theory of human values

Quintin Pope and TurnTrout

Sep 4, 2022, 4:28 AM

255 points

67 comments24 min readLW link 2 reviews

AI alignment is distinct from its near-term applications

paulfchristianoDec 13, 2022, 7:10 AM

255 points

21 comments2 min readLW link

(ai-alignment.com)

New Scaling Laws for Large Language Models

1a3ornApr 1, 2022, 8:41 PM

246 points

22 comments5 min readLW link

How “Discovering Latent Knowledge in Language Models Without Supervision” Fits Into a Broader Alignment Scheme

CollinDec 15, 2022, 6:22 PM

244 points

39 comments16 min readLW link 1 review

A Quick Guide to Confronting Doom

RubyApr 13, 2022, 7:30 PM

243 points

33 comments2 min readLW link

Jailbreaking ChatGPT on Release Day

ZviDec 2, 2022, 1:10 PM

242 points

77 comments6 min readLW link 1 review

(thezvi.wordpress.com)

Slow motion videos as AI risk intuition pumps

Andrew_CritchJun 14, 2022, 7:31 PM

241 points

41 comments2 min readLW link 1 review

The Plan − 2022 Update

johnswentworthDec 1, 2022, 8:43 PM

239 points

37 comments8 min readLW link 1 review

Common misconceptions about OpenAI

Jacob_HiltonAug 25, 2022, 2:02 PM

237 points

154 comments5 min readLW link 1 review

Contra Hofstadter on GPT-3 Nonsense

ricticJun 15, 2022, 9:53 PM

237 points

24 comments2 min readLW link

Introduction to abstract entropy

Alex_AltairOct 20, 2022, 9:03 PM

237 points

78 comments18 min readLW link 1 review

An Observation of Vavilov Day

ElizabethJan 3, 2022, 9:10 PM

236 points

42 comments3 min readLW link

(acesounderglass.com)

Announcing Balsa Research

ZviSep 25, 2022, 10:50 PM

235 points

64 comments2 min readLW link 1 review

(thezvi.wordpress.com)

ProjectLawful.com: Eliezer’s latest story, past 1M words

Eliezer YudkowskyMay 11, 2022, 6:18 AM

234 points

112 comments1 min readLW link 4 reviews

Editing Advice for LessWrong Users

JustisMillsApr 11, 2022, 4:32 PM

233 points

14 comments6 min readLW link 1 review

(briefly) RaDVaC and SMTM, two things we should be doing

Eliezer YudkowskyJan 12, 2022, 6:20 AM

230 points

79 comments3 min readLW link 1 review

AGI Safety FAQ / all-dumb-questions-allowed thread

Aryeh EnglanderJun 7, 2022, 5:47 AM

227 points

526 comments4 min readLW link

Moses and the Class Struggle

lsusrApr 1, 2022, 11:55 AM

225 points

26 comments5 min readLW link

Replacing Karma with Good Heart Tokens (Worth $1!)

Ben Pace and habryka

Apr 1, 2022, 9:31 AM

225 points

173 comments4 min readLW link

How I buy things when Lightcone wants them fast

Bird ConceptSep 26, 2022, 5:02 AM

224 points

21 comments8 min readLW link

What do ML researchers think about AI in 2022?

KatjaGraceAug 4, 2022, 3:40 PM

221 points

33 comments3 min readLW link

(aiimpacts.org)

Lessons learned from talking to >100 academics about AI safety

Marius HobbhahnOct 10, 2022, 1:16 PM

216 points

18 comments12 min readLW link 1 review

Humans provide an untapped wealth of evidence about alignment

TurnTrout and Quintin Pope

Jul 14, 2022, 2:31 AM

212 points

94 comments9 min readLW link 1 review

Unifying Bargaining Notions (1/2)

DiffractorJul 25, 2022, 12:28 AM

210 points

41 comments16 min readLW link

How To Go From Interpretability To Alignment: Just Retarget The Search

johnswentworthAug 10, 2022, 4:08 PM

209 points

34 comments3 min readLW link 1 review

Visible Homelessness in SF: A Quick Breakdown of Causes

alyssavanceMay 25, 2022, 1:40 AM

209 points

32 comments2 min readLW link

What does it take to defend the world against out-of-control AGIs?

Steven ByrnesOct 25, 2022, 2:47 PM

208 points

49 comments30 min readLW link 1 review

Worlds Where Iterative Design Fails

johnswentworthAug 30, 2022, 8:48 PM

208 points

30 comments10 min readLW link 1 review

What it’s like to dissect a cadaver

Alok SinghNov 10, 2022, 6:40 AM

208 points

24 comments5 min readLW link

(alok.github.io)

Benign Boundary Violations

Duncan Sabien (Deactivated)May 26, 2022, 6:48 AM

207 points

84 comments18 min readLW link 1 review

Call For Distillers

johnswentworthApr 4, 2022, 6:25 PM

207 points

43 comments3 min readLW link 1 review

Causal Scrubbing: a method for rigorously testing interpretability hypotheses [Redwood Research]

LawrenceC, Adrià Garriga-alonso, Nicholas Goldowsky-Dill, ryan_greenblatt, jenny, Ansh Radhakrishnan, Buck and Nate Thomas

Dec 3, 2022, 12:58 AM

206 points

35 comments20 min readLW link 1 review

I Converted Book I of The Sequences Into A Zoomer-Readable Format

dkirmaniNov 10, 2022, 2:59 AM

200 points

32 comments2 min readLW link

Brain Efficiency: Much More than You Wanted to Know

jacob_cannellJan 6, 2022, 3:38 AM

200 points

103 comments29 min readLW link

Butterfly Ideas

ElizabethFeb 22, 2022, 7:40 AM

200 points

10 comments3 min readLW link 2 reviews

(acesounderglass.com)

A concrete bet offer to those with short AGI timelines

Matthew Barnett and Tamay

Apr 9, 2022, 9:41 PM

199 points

120 comments5 min readLW link

The Singular Value Decompositions of Transformer Weight Matrices are Highly Interpretable

beren and Sid Black

Nov 28, 2022, 12:54 PM

199 points

33 comments31 min readLW link

Do a cost-benefit analysis of your technology usage

TurnTroutMar 27, 2022, 11:09 PM

198 points

53 comments13 min readLW link

We Are Conjecture, A New Alignment Research Startup

Connor LeahyApr 8, 2022, 11:40 AM

197 points

25 comments4 min readLW link

A note about differential technological development

So8resJul 15, 2022, 4:46 AM

197 points

33 comments6 min readLW link

Connor Leahy on Dying with Dignity, EleutherAI and Conjecture

Michaël TrazziJul 22, 2022, 6:44 PM

195 points

29 comments14 min readLW link

(theinsideview.ai)

How my team at Lightcone sometimes gets stuff done

Bird ConceptSep 19, 2022, 5:47 AM

192 points

43 comments7 min readLW link 1 review

On saving one’s world

Rob BensingerMay 17, 2022, 7:53 PM

192 points

4 comments1 min readLW link

Tyranny of the Epistemic Majority

Scott GarrabrantNov 22, 2022, 5:19 PM

192 points

13 comments9 min readLW link 1 review

Deliberate Grieving

RaemonMay 30, 2022, 8:49 PM

188 points

16 comments9 min readLW link 2 reviews

Intro to Naturalism: Orientation

LoganStrohl and Duncan Sabien (Deactivated)

Feb 13, 2022, 7:52 AM

187 points

23 comments7 min readLW link 2 reviews

Have You Tried Hiring People?

rank-biserialMar 2, 2022, 2:06 AM

185 points

117 comments8 min readLW link 1 review

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer