RSS

The Day After Move 37

Eneasz10 Mar 2026 23:05 UTC
10 points
0 comments6 min readLW link
(deathisbad.substack.com)

In­ter­view with Steven Byrnes on His Main­line Take­off Scenario

Liron10 Mar 2026 20:17 UTC
30 points
0 comments57 min readLW link
(doomdebates.com)

Au­ditBench: Eval­u­at­ing Align­ment Au­dit­ing Tech­niques on Models with Hid­den Behaviors

abhayesian10 Mar 2026 19:31 UTC
21 points
1 comment8 min readLW link
(alignment.anthropic.com)

Eco­nomic effi­ciency of­ten un­der­mines so­ciopoli­ti­cal autonomy

Richard_Ngo10 Mar 2026 19:30 UTC
65 points
6 comments12 min readLW link
(www.mindthefuture.info)

Let­ting Claude do Au­tonomous Re­search to Im­prove SAEs

chanind10 Mar 2026 18:52 UTC
48 points
2 comments7 min readLW link

Don’t Let LLMs Write For You

JustisMills10 Mar 2026 18:49 UTC
58 points
1 comment3 min readLW link
(justismills.substack.com)

Ques­tions to ask when ev­ery­one is shoot­ing them­selves in the foot

jasoncrawford10 Mar 2026 18:36 UTC
20 points
1 comment1 min readLW link

The case for sa­ti­at­ing cheaply-satis­fied AI preferences

Alex Mallen10 Mar 2026 18:09 UTC
60 points
3 comments23 min readLW link

Gemma Needs Help

Anna Soligo10 Mar 2026 17:39 UTC
108 points
8 comments6 min readLW link

Not Lov­ing Lik­ing What You See

Tomás B.10 Mar 2026 16:05 UTC
35 points
5 comments5 min readLW link

Statis­ti­cism: How Cluster-Think­ing About Data Creates Blind Spots

Benquo10 Mar 2026 13:59 UTC
22 points
0 comments12 min readLW link
(benjaminrosshoffman.com)

Spon­ta­neous Sym­me­try Break­ing (Stat Mech Part 4)

J Bostock10 Mar 2026 13:21 UTC
13 points
2 comments4 min readLW link

Why I don’t usu­ally recom­mend dead drops

samuelshadrach10 Mar 2026 13:13 UTC
7 points
2 comments4 min readLW link
(samuelshadrach.com)

Four Sce­nar­ios of Job-Re­duc­ing AI

Celer10 Mar 2026 13:10 UTC
11 points
2 comments4 min readLW link
(keller.substack.com)

Un­der­stand­ing Rea­son­ing with Thought An­chors and Probes

10 Mar 2026 11:50 UTC
7 points
0 comments13 min readLW link

Con­tra My­self on Free Will

Julius10 Mar 2026 6:29 UTC
9 points
5 comments10 min readLW link
(thegreymatter.substack.com)

The case for AI safety ca­pac­ity-build­ing work

abergal10 Mar 2026 2:43 UTC
44 points
14 comments22 min readLW link

Chore Standards

jefftk10 Mar 2026 2:30 UTC
27 points
1 comment2 min readLW link
(www.jefftk.com)

An­cient The­o­ries On The Ori­gins Of Life

Algon10 Mar 2026 1:55 UTC
21 points
0 comments3 min readLW link
(algon33.substack.com)

Im­mor­tal­ity: A Begin­ner’s Guide (Part 2)

MarkelKori10 Mar 2026 0:11 UTC
9 points
4 comments5 min readLW link