AGI Ruin: A List of Lethalities

Eliezer YudkowskyJun 5, 2022, 10:05 PM
929 points
708 comments30 min readLW link3 reviews

Where I agree and dis­agree with Eliezer

paulfchristianoJun 19, 2022, 7:15 PM
898 points
223 comments18 min readLW link2 reviews

What an ac­tu­ally pes­simistic con­tain­ment strat­egy looks like

lcApr 5, 2022, 12:19 AM
679 points
138 comments6 min readLW link2 reviews

Simulators

janusSep 2, 2022, 12:45 PM
631 points
168 comments41 min readLW link8 reviews
(generative.ink)

Let’s think about slow­ing down AI

KatjaGraceDec 22, 2022, 5:40 PM
551 points
182 comments38 min readLW link3 reviews
(aiimpacts.org)

The Redac­tion Machine

BenSep 20, 2022, 10:03 PM
503 points
48 comments27 min readLW link1 review

Luck based medicine: my re­sent­ful story of be­com­ing a med­i­cal miracle

ElizabethOct 16, 2022, 5:40 PM
488 points
121 comments12 min readLW link3 reviews
(acesounderglass.com)

Los­ing the root for the tree

Adam ZernerSep 20, 2022, 4:53 AM
480 points
31 comments9 min readLW link1 review

Counter-the­ses on Sleep

NatáliaMar 21, 2022, 11:21 PM
447 points
135 comments15 min readLW link1 review

It’s Prob­a­bly Not Lithium

NatáliaJun 28, 2022, 9:24 PM
442 points
187 comments28 min readLW link1 review

chin­chilla’s wild implications

nostalgebraistJul 31, 2022, 1:18 AM
424 points
128 comments10 min readLW link1 review

(My un­der­stand­ing of) What Every­one in Tech­ni­cal Align­ment is Do­ing and Why

Aug 29, 2022, 1:23 AM
413 points
90 comments37 min readLW link1 review

You Are Not Mea­sur­ing What You Think You Are Measuring

johnswentworthSep 20, 2022, 8:04 PM
407 points
44 comments8 min readLW link2 reviews

It Looks Like You’re Try­ing To Take Over The World

gwernMar 9, 2022, 4:35 PM
407 points
120 comments1 min readLW link1 review
(www.gwern.net)

Deep­Mind al­ign­ment team opinions on AGI ruin arguments

VikaAug 12, 2022, 9:06 PM
395 points
37 comments14 min readLW link1 review

Reflec­tions on six months of fatherhood

jasoncrawfordJan 31, 2022, 5:28 AM
387 points
24 comments4 min readLW link1 review
(jasoncrawford.org)

Lies Told To Children

Eliezer YudkowskyApr 14, 2022, 11:25 AM
381 points
94 comments7 min readLW link1 review

Re­ward is not the op­ti­miza­tion target

TurnTroutJul 25, 2022, 12:03 AM
375 points
123 comments10 min readLW link3 reviews

A Mechanis­tic In­ter­pretabil­ity Anal­y­sis of Grokking

Aug 15, 2022, 2:41 AM
373 points
48 comments36 min readLW link1 review
(colab.research.google.com)

Coun­ter­ar­gu­ments to the ba­sic AI x-risk case

KatjaGraceOct 14, 2022, 1:00 PM
371 points
124 comments34 min readLW link1 review
(aiimpacts.org)

Without spe­cific coun­ter­mea­sures, the eas­iest path to trans­for­ma­tive AI likely leads to AI takeover

Ajeya CotraJul 18, 2022, 7:06 PM
368 points
95 comments75 min readLW link1 review

Ac­count­ing For Col­lege Costs

johnswentworthApr 1, 2022, 5:28 PM
366 points
41 comments7 min readLW link

Se­cu­rity Mind­set: Les­sons from 20+ years of Soft­ware Se­cu­rity Failures Rele­vant to AGI Alignment

elspoodJun 21, 2022, 11:55 PM
362 points
42 comments7 min readLW link1 review

Star­ing into the abyss as a core life skill

benkuhnDec 22, 2022, 3:30 PM
354 points
22 comments12 min readLW link1 review
(www.benkuhn.net)

MIRI an­nounces new “Death With Dig­nity” strategy

Eliezer YudkowskyApr 2, 2022, 12:43 AM
354 points
546 comments18 min readLW link1 review

What DALL-E 2 can and can­not do

Swimmer963 (Miranda Dixon-Luinenburg) May 1, 2022, 11:51 PM
353 points
303 comments9 min readLW link

Be­ware boast­ing about non-ex­is­tent fore­cast­ing track records

Jotto999May 20, 2022, 7:20 PM
338 points
112 comments5 min readLW link1 review

What should you change in re­sponse to an “emer­gency”? And AI risk

AnnaSalamonJul 18, 2022, 1:11 AM
337 points
60 comments6 min readLW link1 review

Why I think strong gen­eral AI is com­ing soon

porbySep 28, 2022, 5:40 AM
336 points
141 comments34 min readLW link1 review

Look­ing back on my al­ign­ment PhD

TurnTroutJul 1, 2022, 3:19 AM
334 points
66 comments11 min readLW link

Op­ti­mal­ity is the tiger, and agents are its teeth

VeedracApr 2, 2022, 12:46 AM
327 points
44 comments16 min readLW link1 review

Models Don’t “Get Re­ward”

Sam RingerDec 30, 2022, 10:37 AM
313 points
61 comments5 min readLW link1 review

On how var­i­ous plans miss the hard bits of the al­ign­ment challenge

So8resJul 12, 2022, 2:49 AM
313 points
89 comments29 min readLW link3 reviews

Six Di­men­sions of Oper­a­tional Ad­e­quacy in AGI Projects

Eliezer YudkowskyMay 30, 2022, 5:00 PM
310 points
66 comments13 min readLW link1 review

Epistemic Legibility

ElizabethFeb 9, 2022, 6:10 PM
309 points
30 comments20 min readLW link1 review
(acesounderglass.com)

Why Agent Foun­da­tions? An Overly Ab­stract Explanation

johnswentworthMar 25, 2022, 11:17 PM
302 points
58 comments8 min readLW link1 review

A challenge for AGI or­ga­ni­za­tions, and a challenge for readers

Dec 1, 2022, 11:11 PM
302 points
33 comments2 min readLW link

Two-year up­date on my per­sonal AI timelines

Ajeya CotraAug 2, 2022, 11:07 PM
293 points
60 comments16 min readLW link

What Are You Track­ing In Your Head?

johnswentworthJun 28, 2022, 7:30 PM
287 points
83 comments4 min readLW link1 review

Mys­ter­ies of mode collapse

janusNov 8, 2022, 10:37 AM
284 points
57 comments14 min readLW link1 review

Sazen

Duncan Sabien (Deactivated)Dec 21, 2022, 7:54 AM
281 points
83 comments12 min readLW link2 reviews

We Choose To Align AI

johnswentworthJan 1, 2022, 8:06 PM
280 points
16 comments3 min readLW link1 review

Don’t die with dig­nity; in­stead play to your outs

Jeffrey LadishApr 6, 2022, 7:53 AM
280 points
60 comments5 min readLW link

Is AI Progress Im­pos­si­ble To Pre­dict?

alyssavanceMay 15, 2022, 6:30 PM
277 points
39 comments2 min readLW link

A cen­tral AI al­ign­ment prob­lem: ca­pa­bil­ities gen­er­al­iza­tion, and the sharp left turn

So8resJun 15, 2022, 1:10 PM
272 points
55 comments10 min readLW link1 review

Toni Kurz and the In­san­ity of Climb­ing Mountains

GeneSmithJul 3, 2022, 8:51 PM
271 points
67 comments11 min readLW link2 reviews

Hu­mans are very re­li­able agents

alyssavanceJun 16, 2022, 10:02 PM
269 points
35 comments3 min readLW link

12 in­ter­est­ing things I learned study­ing the dis­cov­ery of na­ture’s laws

Ben PaceFeb 19, 2022, 11:39 PM
268 points
40 comments9 min readLW link1 review

Com­ment re­ply: my low-qual­ity thoughts on why CFAR didn’t get farther with a “real/​effi­ca­cious art of ra­tio­nal­ity”

AnnaSalamonJun 9, 2022, 2:12 AM
261 points
63 comments17 min readLW link1 review

Chang­ing the world through slack & hobbies

Steven ByrnesJul 21, 2022, 6:11 PM
261 points
13 comments10 min readLW link