AGI Ruin: A List of Lethalities

Eliezer Yudkowsky5 Jun 2022 22:05 UTC
908 points
701 comments30 min readLW link3 reviews

Where I agree and dis­agree with Eliezer

paulfchristiano19 Jun 2022 19:15 UTC
888 points
220 comments18 min readLW link2 reviews

Eight Short Stud­ies On Excuses

Scott Alexander20 Apr 2010 23:01 UTC
834 points
253 comments10 min readLW link

Preface

Eliezer Yudkowsky11 Mar 2015 19:00 UTC
779 points
15 comments4 min readLW link

The Best Text­books on Every Subject

lukeprog16 Jan 2011 8:30 UTC
746 points
411 comments7 min readLW link

SolidGoldMag­ikarp (plus, prompt gen­er­a­tion)

5 Feb 2023 22:02 UTC
676 points
205 comments12 min readLW link

What an ac­tu­ally pes­simistic con­tain­ment strat­egy looks like

lc5 Apr 2022 0:19 UTC
675 points
138 comments6 min readLW link2 reviews

The Waluigi Effect (mega-post)

Cleo Nardo3 Mar 2023 3:22 UTC
628 points
187 comments16 min readLW link

Simulators

janus2 Sep 2022 12:45 UTC
609 points
162 comments41 min readLW link8 reviews
(generative.ink)

Ra­tion­al­ism be­fore the Sequences

Eric Raymond30 Mar 2021 14:04 UTC
593 points
83 comments11 min readLW link2 reviews

Schel­ling fences on slip­pery slopes

Scott Alexander16 Mar 2012 23:44 UTC
592 points
250 comments6 min readLW link

Mak­ing Vaccine

johnswentworth3 Feb 2021 20:24 UTC
577 points
249 comments6 min readLW link3 reviews

LessWrong’s (first) album: I Have Been A Good Bing

1 Apr 2024 7:33 UTC
564 points
174 comments11 min readLW link

Hu­mans are not au­to­mat­i­cally strategic

AnnaSalamon8 Sep 2010 7:02 UTC
549 points
277 comments4 min readLW link

Let’s think about slow­ing down AI

KatjaGrace22 Dec 2022 17:40 UTC
549 points
182 comments38 min readLW link3 reviews
(aiimpacts.org)

Pain is not the unit of Effort

alkjash24 Nov 2020 20:00 UTC
545 points
90 comments5 min readLW link2 reviews
(radimentary.wordpress.com)

Diseased think­ing: dis­solv­ing ques­tions about disease

Scott Alexander30 May 2010 21:16 UTC
527 points
356 comments9 min readLW link

What 2026 looks like

Daniel Kokotajlo6 Aug 2021 16:14 UTC
520 points
155 comments16 min readLW link1 review

The Redac­tion Machine

Ben20 Sep 2022 22:03 UTC
500 points
48 comments27 min readLW link1 review

Rea­son as memetic im­mune disorder

PhilGoetz19 Sep 2009 21:05 UTC
498 points
185 comments5 min readLW link

The Talk: a brief ex­pla­na­tion of sex­ual dimorphism

Malmesbury18 Sep 2023 16:23 UTC
492 points
72 comments16 min readLW link

Luck based medicine: my re­sent­ful story of be­com­ing a med­i­cal miracle

Elizabeth16 Oct 2022 17:40 UTC
483 points
121 comments12 min readLW link3 reviews
(acesounderglass.com)

Los­ing the root for the tree

Adam Zerner20 Sep 2022 4:53 UTC
474 points
31 comments9 min readLW link1 review

Mak­ing Beliefs Pay Rent (in An­ti­ci­pated Ex­pe­riences)

Eliezer Yudkowsky28 Jul 2007 22:59 UTC
473 points
267 comments4 min readLW link

How much do you be­lieve your re­sults?

Eric Neyman6 May 2023 20:31 UTC
473 points
14 comments15 min readLW link
(ericneyman.wordpress.com)

OpenAI Email Archives (from Musk v. Alt­man)

habryka16 Nov 2024 6:38 UTC
463 points
58 comments32 min readLW link

Sig­nifi­cantly En­hanc­ing Adult In­tel­li­gence With Gene Edit­ing May Be Possible

12 Dec 2023 18:14 UTC
451 points
170 comments33 min readLW link

Be­ing the (Pareto) Best in the World

johnswentworth24 Jun 2019 18:36 UTC
448 points
60 comments3 min readLW link3 reviews

How To Write Quickly While Main­tain­ing Epistemic Rigor

johnswentworth28 Aug 2021 17:52 UTC
447 points
38 comments4 min readLW link3 reviews

100 Tips for a Bet­ter Life

Ideopunk22 Dec 2020 14:30 UTC
446 points
130 comments9 min readLW link1 review

Counter-the­ses on Sleep

Natália21 Mar 2022 23:21 UTC
444 points
131 comments15 min readLW link1 review

It’s Prob­a­bly Not Lithium

Natália28 Jun 2022 21:24 UTC
442 points
187 comments28 min readLW link1 review

Gen­er­al­iz­ing From One Example

Scott Alexander28 Apr 2009 22:00 UTC
437 points
423 comments6 min readLW link

Steer­ing GPT-2-XL by adding an ac­ti­va­tion vector

13 May 2023 18:42 UTC
436 points
97 comments50 min readLW link

I would have shit in that alley, too

Declan Molony18 Jun 2024 4:41 UTC
432 points
134 comments4 min readLW link

Wel­come to LessWrong!

14 Jun 2019 19:42 UTC
431 points
60 comments2 min readLW link

The ants and the grasshopper

Richard_Ngo4 Jun 2023 22:00 UTC
428 points
35 comments5 min readLW link
(www.narrativeark.xyz)

Dou­glas Hofs­tadter changes his mind on Deep Learn­ing & AI risk (June 2023)?

gwern3 Jul 2023 0:48 UTC
425 points
54 comments7 min readLW link
(www.youtube.com)

Fo­cus on the places where you feel shocked ev­ery­one’s drop­ping the ball

So8res2 Feb 2023 0:27 UTC
421 points
60 comments4 min readLW link

chin­chilla’s wild implications

nostalgebraist31 Jul 2022 1:18 UTC
420 points
128 comments10 min readLW link1 review

The non­cen­tral fal­lacy—the worst ar­gu­ment in the world?

Scott Alexander27 Aug 2012 3:36 UTC
418 points
1,768 comments7 min readLW link

Bets, Bonds, and Kindergarteners

jefftk3 Jan 2021 21:20 UTC
418 points
35 comments2 min readLW link1 review
(www.jefftk.com)

What failure looks like

paulfchristiano17 Mar 2019 20:18 UTC
416 points
54 comments8 min readLW link2 reviews

Things I Learned by Spend­ing Five Thou­sand Hours In Non-EA Charities

jenn1 Jun 2023 20:48 UTC
414 points
34 comments8 min readLW link
(jenn.site)

(My un­der­stand­ing of) What Every­one in Tech­ni­cal Align­ment is Do­ing and Why

29 Aug 2022 1:23 UTC
413 points
90 comments37 min readLW link1 review

Trans­form­ers Rep­re­sent Belief State Geom­e­try in their Resi­d­ual Stream

Adam Shai16 Apr 2024 21:16 UTC
411 points
100 comments12 min readLW link

It Looks Like You’re Try­ing To Take Over The World

gwern9 Mar 2022 16:35 UTC
406 points
120 comments1 min readLW link1 review
(www.gwern.net)

GPTs are Pre­dic­tors, not Imitators

Eliezer Yudkowsky8 Apr 2023 19:59 UTC
403 points
91 comments3 min readLW link

Failures in Kindness

silentbob26 Mar 2024 21:30 UTC
401 points
60 comments9 min readLW link

Ugh fields

Roko12 Apr 2010 17:06 UTC
401 points
81 comments3 min readLW link