You don’t know how bad most things are nor pre­cisely how they’re bad.

Solenoid_EntityAug 4, 2024, 2:12 PM
327 points
49 comments5 min readLW link

Would catch­ing your AIs try­ing to es­cape con­vince AI de­vel­op­ers to slow down or un­de­ploy?

BuckAug 26, 2024, 4:46 PM
314 points
77 comments4 min readLW link

Leav­ing MIRI, Seek­ing Funding

abramdemskiAug 8, 2024, 6:32 PM
264 points
19 comments2 min readLW link

Prin­ci­ples for the AGI Race

William_SAug 30, 2024, 2:29 PM
248 points
17 comments18 min readLW link

The ‘strong’ fea­ture hy­poth­e­sis could be wrong

lewis smithAug 2, 2024, 2:33 PM
231 points
19 comments17 min readLW link

AGI Safety and Align­ment at Google Deep­Mind: A Sum­mary of Re­cent Work

Aug 20, 2024, 4:22 PM
222 points
33 comments9 min readLW link

How I Learned To Stop Trust­ing Pre­dic­tion Mar­kets and Love the Arbitrage

orthonormalAug 6, 2024, 2:32 AM
198 points
30 comments3 min readLW link

WTH is Cere­brolysin, ac­tu­ally?

Aug 6, 2024, 8:40 PM
175 points
23 comments17 min readLW link

You can re­move GPT2’s Lay­erNorm by fine-tun­ing for an hour

StefanHexAug 8, 2024, 6:33 PM
165 points
11 comments8 min readLW link

[Question] things that con­fuse me about the cur­rent AI mar­ket.

DMMFAug 28, 2024, 1:46 PM
156 points
27 comments2 min readLW link

Li­a­bil­ity regimes for AI

Ege ErdilAug 19, 2024, 1:25 AM
153 points
34 comments5 min readLW link

The In­for­ma­tion: OpenAI shows ‘Straw­berry’ to feds, races to launch it

Martín SotoAug 27, 2024, 11:10 PM
145 points
15 comments3 min readLW link

Fields that I refer­ence when think­ing about AI takeover prevention

BuckAug 13, 2024, 11:08 PM
144 points
16 comments10 min readLW link
(redwoodresearch.substack.com)

Nurs­ing doubts

dynomightAug 30, 2024, 2:25 AM
144 points
23 comments9 min readLW link
(dynomight.net)

Limi­ta­tions on For­mal Ver­ifi­ca­tion for AI Safety

Andrew DicksonAug 19, 2024, 11:03 PM
134 points
60 comments23 min readLW link

Par­a­sites (not a metaphor)

lemonhopeAug 8, 2024, 8:07 PM
133 points
19 comments1 min readLW link

“Can AI Scal­ing Con­tinue Through 2030?”, Epoch AI (yes)

gwernAug 24, 2024, 1:40 AM
130 points
4 comments3 min readLW link
(epochai.org)

How I started be­liev­ing re­li­gion might ac­tu­ally mat­ter for ra­tio­nal­ity and moral philosophy

zhukeepaAug 23, 2024, 5:40 PM
129 points
41 comments7 min readLW link

Near-mode think­ing on AI

Olli JärviniemiAug 4, 2024, 8:47 PM
128 points
9 comments5 min readLW link

In­ves­ti­gat­ing the Chart of the Cen­tury: Why is food so ex­pen­sive?

Maxwell TabarrokAug 16, 2024, 1:21 PM
122 points
26 comments3 min readLW link
(www.maximum-progress.com)

Please stop us­ing mediocre AI art in your posts

RaemonAug 25, 2024, 12:13 AM
115 points
24 comments2 min readLW link

Ten ar­gu­ments that AI is an ex­is­ten­tial risk

Aug 13, 2024, 5:00 PM
113 points
42 comments7 min readLW link
(blog.aiimpacts.org)

Please sup­port this blog (with money)

ElizabethAug 17, 2024, 3:30 PM
112 points
3 comments6 min readLW link
(acesounderglass.com)

A primer on the cur­rent state of longevity research

Abhishaike MahajanAug 22, 2024, 5:14 PM
109 points
6 comments14 min readLW link
(www.owlposting.com)

Danger, AI Scien­tist, Danger

ZviAug 15, 2024, 10:40 PM
107 points
9 comments7 min readLW link
(thezvi.wordpress.com)

Per­plex­ity wins my AI race

ElizabethAug 24, 2024, 7:20 PM
107 points
12 comments10 min readLW link
(acesounderglass.com)

LLM Ap­pli­ca­tions I Want To See

sarahconstantinAug 19, 2024, 9:10 PM
102 points
6 comments8 min readLW link
(sarahconstantin.substack.com)

Why you should be us­ing a retinoid

GeneSmithAug 19, 2024, 3:07 AM
98 points
60 comments5 min readLW link

the Giga Press was a mistake

bhauthAug 21, 2024, 4:51 AM
98 points
26 comments5 min readLW link
(bhauth.com)

It’s time for a self-re­pro­duc­ing machine

Carl FeynmanAug 7, 2024, 9:52 PM
96 points
69 comments9 min readLW link

[Question] Am I con­fused about the “ma­lign uni­ver­sal prior” ar­gu­ment?

nostalgebraistAug 27, 2024, 11:17 PM
95 points
35 comments8 min readLW link

Dragon Agnosticism

jefftkAug 1, 2024, 5:00 PM
94 points
75 comments2 min readLW link
(www.jefftk.com)

Defin­ing al­ign­ment research

Richard_NgoAug 19, 2024, 8:42 PM
92 points
23 comments7 min readLW link

SB 1047: Fi­nal Takes and Also AB 3211

ZviAug 27, 2024, 10:10 PM
92 points
11 comments21 min readLW link
(thezvi.wordpress.com)

Cir­cu­lar Reasoning

abramdemskiAug 5, 2024, 6:10 PM
91 points
37 comments8 min readLW link

Sin­gu­lar learn­ing the­ory: exercises

Zach FurmanAug 30, 2024, 8:00 PM
90 points
5 comments14 min readLW link

Solv­ing ad­ver­sar­ial at­tacks in com­puter vi­sion as a baby ver­sion of gen­eral AI alignment

Stanislav FortAug 29, 2024, 5:17 PM
88 points
8 comments7 min readLW link

What De­pres­sion Is Like

SableAug 27, 2024, 5:43 PM
87 points
24 comments4 min readLW link
(affablyevil.substack.com)

Dar­wi­nian Traps and Ex­is­ten­tial Risks

KristianRonnAug 25, 2024, 10:37 PM
85 points
14 comments10 min readLW link

If we solve al­ign­ment, do we die any­way?

Seth HerdAug 23, 2024, 1:13 PM
84 points
129 comments4 min readLW link

Sec­u­lar in­ter­pre­ta­tions of core peren­ni­al­ist claims

zhukeepaAug 25, 2024, 11:41 PM
83 points
32 comments14 min readLW link

Re­lease: Op­ti­mal Weave (P1): A Pro­to­type Co­hab­itive Game

mako yassAug 17, 2024, 2:08 PM
82 points
21 comments7 min readLW link

In Defense of Open-Minded UDT

abramdemskiAug 12, 2024, 6:27 PM
79 points
28 comments11 min readLW link

Quick look: ap­pli­ca­tions of chaos theory

Aug 18, 2024, 3:00 PM
79 points
51 comments8 min readLW link
(acesounderglass.com)

Value frag­ility and AI takeover

Joe CarlsmithAug 5, 2024, 9:28 PM
76 points
5 comments30 min readLW link

Soft Na­tion­al­iza­tion: how the USG will con­trol AI labs

27 Aug 2024 15:11 UTC
76 points
7 comments21 min readLW link
(www.convergenceanalysis.org)

A Sim­ple Toy Co­her­ence Theorem

2 Aug 2024 17:47 UTC
74 points
22 comments7 min readLW link

AI for Bio: State Of The Field

sarahconstantin30 Aug 2024 18:00 UTC
73 points
2 comments15 min readLW link
(sarahconstantin.substack.com)

What is “True Love”?

johnswentworth18 Aug 2024 16:05 UTC
72 points
11 comments1 min readLW link

Far­mKind’s Illu­sory Offer

jefftk9 Aug 2024 11:30 UTC
71 points
5 comments3 min readLW link
(www.jefftk.com)