RSS

--dan­ger­ously-skip-permissions

wingspan12 Jan 2026 7:37 UTC
3 points
0 comments3 min readLW link

Wel­come to the Daily Show! Ex­plain­ing Doom to Lay Folks

Ryan Meservey12 Jan 2026 5:48 UTC
0 points
0 comments6 min readLW link

[Question] What po­tent con­sumer tech­nolo­gies have long re­mained in­ac­cessible?

TsviBT12 Jan 2026 3:13 UTC
18 points
1 comment4 min readLW link

Digi­tal in­ten­tion­al­ity is not about productivity

mingyuan12 Jan 2026 3:09 UTC
20 points
0 comments3 min readLW link
(mingyuan.substack.com)

De pluribus non est disputandum

Jacob Goldsmith12 Jan 2026 0:07 UTC
4 points
0 comments3 min readLW link

5 Con­sid­er­a­tions for Per­sonal Donations

Tristan W11 Jan 2026 17:47 UTC
3 points
1 comment9 min readLW link

We need a bet­ter way to eval­u­ate emer­gent misalignment

11 Jan 2026 16:21 UTC
62 points
1 comment6 min readLW link

Cod­ing Agents As An In­ter­face To The Codebase

omegastick11 Jan 2026 10:31 UTC
15 points
1 comment3 min readLW link
(dumbideas.xyz)

Why AIs aren’t power-seek­ing yet

Eli Tyre11 Jan 2026 7:07 UTC
74 points
7 comments7 min readLW link

The­o­ret­i­cal pre­dic­tions on the sam­ple effi­ciency of train­ing poli­cies and ac­ti­va­tion monitors

Alek Westover10 Jan 2026 23:50 UTC
17 points
2 comments7 min readLW link

If AI al­ign­ment is only as hard as build­ing the steam en­g­ine, then we likely still die

MichaelDickens10 Jan 2026 23:10 UTC
31 points
7 comments4 min readLW link

Pos­si­ble Prin­ci­ples of Superagency

Mariven10 Jan 2026 21:00 UTC
8 points
0 comments12 min readLW link
(mariven.substack.com)

The Case Against Con­tin­u­ous Chain-of-Thought (Neu­ralese)

RobinHa10 Jan 2026 20:32 UTC
7 points
7 comments5 min readLW link

The false con­fi­dence the­o­rem and Bayesian reasoning

viking_math10 Jan 2026 17:14 UTC
22 points
9 comments6 min readLW link

A Pro­posal for a Bet­ter ARENA: Shift­ing from Teach­ing to Re­search Sprints

TheManxLoiner10 Jan 2026 16:56 UTC
24 points
8 comments6 min readLW link

Mo­ral-Epistemic Scrupu­los­ity: A Cross-Frame­work Failure Mode of Truth-Seeking

Tamara Sofía Falcone10 Jan 2026 2:24 UTC
14 points
2 comments8 min readLW link

Find­ing high sig­nal peo­ple—ap­ply­ing PageRank to Twitter

jfguan10 Jan 2026 2:21 UTC
24 points
0 comments3 min readLW link
(thefourierproject.org)

AI In­ci­dent Forecasting

cluebbers10 Jan 2026 2:17 UTC
8 points
0 comments1 min readLW link
(cluebbers.github.io)

6’7” Is Not Random

Martin Lichstam10 Jan 2026 2:13 UTC
−10 points
2 comments2 min readLW link

What do we mean by “im­pos­si­ble”?

Sniffnoy10 Jan 2026 0:01 UTC
23 points
3 comments2 min readLW link