Failures in Kindness

silentbob26 Mar 2024 21:30 UTC
409 points
60 comments9 min readLW link

On green

Joe Carlsmith21 Mar 2024 17:38 UTC
266 points
35 comments31 min readLW link

My PhD the­sis: Al­gorith­mic Bayesian Epistemology

Eric Neyman16 Mar 2024 22:56 UTC
259 points
14 comments7 min readLW link
(arxiv.org)

My Clients, The Liars

ymeskhout5 Mar 2024 21:06 UTC
248 points
85 comments7 min readLW link

“How could I have thought that faster?”

mesaoptimizer11 Mar 2024 10:56 UTC
222 points
32 comments2 min readLW link
(twitter.com)

Modern Trans­form­ers are AGI, and Hu­man-Level

abramdemski26 Mar 2024 17:46 UTC
219 points
88 comments5 min readLW link

ChatGPT can learn in­di­rect control

Raymond D21 Mar 2024 21:11 UTC
213 points
27 comments1 min readLW link

My In­ter­view With Cade Metz on His Re­port­ing About Slate Star Codex

Zack_M_Davis26 Mar 2024 17:18 UTC
188 points
187 comments6 min readLW link

Daniel Kah­ne­man has died

DanielFilan27 Mar 2024 15:59 UTC
185 points
11 comments1 min readLW link
(www.washingtonpost.com)

Toward a Broader Con­cep­tion of Ad­verse Selection

Ricki Heicklen14 Mar 2024 22:40 UTC
177 points
61 comments13 min readLW link
(bayesshammai.substack.com)

‘Em­piri­cism!’ as Anti-Epistemology

Eliezer Yudkowsky14 Mar 2024 2:02 UTC
171 points
90 comments25 min readLW link

Many ar­gu­ments for AI x-risk are wrong

TurnTrout5 Mar 2024 2:31 UTC
167 points
86 comments12 min readLW link

Us­ing axis lines for good or evil

dynomight6 Mar 2024 14:47 UTC
150 points
39 comments4 min readLW link
(dynomight.net)

Ver­nor Vinge, who coined the term “Tech­nolog­i­cal Sin­gu­lar­ity”, dies at 79

Kaj_Sotala21 Mar 2024 22:14 UTC
149 points
25 comments1 min readLW link
(arstechnica.com)

If you weren’t such an idiot...

2 Mar 2024 0:01 UTC
148 points
74 comments2 min readLW link
(markxu.com)

On Devin

Zvi18 Mar 2024 13:20 UTC
148 points
34 comments11 min readLW link
(thezvi.wordpress.com)

Some (prob­le­matic) aes­thet­ics of what con­sti­tutes good work in academia

Steven Byrnes11 Mar 2024 17:47 UTC
147 points
12 comments12 min readLW link

Read the Roon

Zvi5 Mar 2024 13:50 UTC
136 points
6 comments19 min readLW link
(thezvi.wordpress.com)

The Worst Form Of Govern­ment (Ex­cept For Every­thing Else We’ve Tried)

johnswentworth17 Mar 2024 18:11 UTC
134 points
47 comments4 min readLW link

Com­mu­nity Notes by X

NicholasKees18 Mar 2024 17:13 UTC
124 points
15 comments7 min readLW link

An­thropic re­lease Claude 3, claims >GPT-4 Performance

LawrenceC4 Mar 2024 18:23 UTC
115 points
41 comments2 min readLW link
(www.anthropic.com)

So­cial sta­tus part 1/​2: ne­go­ti­a­tions over ob­ject-level preferences

Steven Byrnes5 Mar 2024 16:29 UTC
115 points
15 comments21 min readLW link

Sim­ple ver­sus Short: Higher-or­der de­gen­er­acy and er­ror-correction

Daniel Murfet11 Mar 2024 7:52 UTC
113 points
6 comments13 min readLW link

The Parable Of The Fallen Pen­du­lum—Part 1

johnswentworth1 Mar 2024 0:25 UTC
111 points
32 comments2 min readLW link

SAE re­con­struc­tion er­rors are (em­piri­cally) pathological

wesg29 Mar 2024 16:37 UTC
105 points
16 comments8 min readLW link

Notes from a Prompt Factory

Richard_Ngo10 Mar 2024 5:13 UTC
101 points
19 comments9 min readLW link
(www.narrativeark.xyz)

LessOn­line (May 31—June 2, Berkeley, CA)

Ben Pace26 Mar 2024 2:34 UTC
100 points
24 comments1 min readLW link
(Less.Online)

Gen­eral Thoughts on Sec­u­lar Solstice

Jeffrey Heninger23 Mar 2024 18:48 UTC
100 points
60 comments8 min readLW link

“Deep Learn­ing” Is Func­tion Approximation

Zack_M_Davis21 Mar 2024 17:50 UTC
98 points
28 comments10 min readLW link
(zackmdavis.net)

On attunement

Joe Carlsmith25 Mar 2024 12:47 UTC
98 points
8 comments22 min readLW link

Notes on Dwarkesh Pa­tel’s Pod­cast with Demis Hassabis

Zvi1 Mar 2024 16:30 UTC
93 points
0 comments8 min readLW link
(thezvi.wordpress.com)

An­nounc­ing Neu­ron­pe­dia: Plat­form for ac­cel­er­at­ing re­search into Sparse Autoencoders

25 Mar 2024 21:17 UTC
92 points
7 comments7 min readLW link

OpenAI: The Board Expands

Zvi12 Mar 2024 14:00 UTC
92 points
1 comment30 min readLW link
(thezvi.wordpress.com)

In­tro­duc­ing METR’s Au­ton­omy Eval­u­a­tion Resources

15 Mar 2024 23:16 UTC
90 points
0 comments1 min readLW link
(metr.github.io)

Stage­wise Devel­op­ment in Neu­ral Networks

20 Mar 2024 19:54 UTC
90 points
1 comment11 min readLW link

New re­port: Safety Cases for AI

joshc20 Mar 2024 16:45 UTC
89 points
14 comments1 min readLW link
(twitter.com)

Nat­u­ral La­tents: The Concepts

20 Mar 2024 18:21 UTC
87 points
18 comments19 min readLW link

Anx­iety vs. Depression

Sable17 Mar 2024 0:15 UTC
85 points
35 comments3 min readLW link
(affablyevil.substack.com)

[Linkpost] Prac­ti­cally-A-Book Re­view: Root­claim $100,000 Lab Leak Debate

trevor28 Mar 2024 16:03 UTC
77 points
22 comments2 min readLW link
(www.astralcodexten.com)

The Cog­ni­tive-The­o­retic Model of the Uni­verse: A Par­tial Sum­mary and Review

jessicata27 Mar 2024 19:59 UTC
77 points
37 comments36 min readLW link
(unstablerontology.substack.com)

The Parable Of The Fallen Pen­du­lum—Part 2

johnswentworth12 Mar 2024 21:41 UTC
77 points
8 comments4 min readLW link

Grief is a fire sale

Nathan Young4 Mar 2024 1:11 UTC
76 points
1 comment4 min readLW link

[Question] What could a policy ban­ning AGI look like?

TsviBT13 Mar 2024 14:19 UTC
76 points
23 comments3 min readLW link

On Claude 3.0

Zvi6 Mar 2024 18:50 UTC
76 points
5 comments31 min readLW link
(thezvi.wordpress.com)

Vote on An­thropic Topics to Discuss

Ben Pace6 Mar 2024 19:43 UTC
75 points
55 comments1 min readLW link

MATS AI Safety Strat­egy Curriculum

7 Mar 2024 19:59 UTC
74 points
2 comments16 min readLW link

Nick Bostrom’s new book, “Deep Utopia”, is out today

PeterH27 Mar 2024 11:24 UTC
73 points
5 comments1 min readLW link
(nickbostrom.com)

“Ar­tifi­cial Gen­eral In­tel­li­gence”: an ex­tremely brief FAQ

Steven Byrnes11 Mar 2024 17:49 UTC
73 points
6 comments2 min readLW link

The World in 2029

Nathan Young2 Mar 2024 18:03 UTC
73 points
37 comments3 min readLW link

Claude 3 claims it’s con­scious, doesn’t want to die or be modified

Mikhail Samin4 Mar 2024 23:05 UTC
72 points
113 comments14 min readLW link