RSS

Solv­ing Slop

Noam Makavy18 Mar 2026 5:18 UTC
0 points
0 comments1 min readLW link

Sy­co­phancy Towards Re­searchers Drives Perfor­ma­tive Misalignment

18 Mar 2026 4:59 UTC
15 points
0 comments21 min readLW link

The Psy­chopa­thy Spectrum

Dawn Drescher17 Mar 2026 21:36 UTC
21 points
0 comments1 min readLW link
(impartial-priorities.org)

LLMs as Gi­ant Lookup-Tables of Shal­low Circuits

17 Mar 2026 21:35 UTC
53 points
8 comments7 min readLW link

Re­quiem for a Tran­shu­man Timeline

Ihor Kendiukhov17 Mar 2026 21:27 UTC
53 points
1 comment5 min readLW link

There is No One There: A sim­ple ex­per­i­ment to con­vince your­self that LLMs prob­a­bly are not conscious

Peter Kuhn17 Mar 2026 17:26 UTC
5 points
8 comments5 min readLW link

Re­search note on win­dow shift­ing training

17 Mar 2026 15:58 UTC
18 points
0 comments15 min readLW link

How to not do de­ci­sion the­ory backwards

Anthony DiGiovanni17 Mar 2026 7:22 UTC
14 points
0 comments16 min readLW link

Mon­day AI Radar #17

Against Moloch17 Mar 2026 4:42 UTC
5 points
0 comments6 min readLW link

The bit­ter les­son for software

16 Mar 2026 23:38 UTC
14 points
2 comments2 min readLW link
(fulcruminc.substack.com)

Types of Hand­off to AIs

Daniel Kokotajlo16 Mar 2026 22:24 UTC
54 points
7 comments8 min readLW link

You can’t imi­ta­tion-learn how to con­tinual-learn

Steven Byrnes16 Mar 2026 21:20 UTC
105 points
23 comments6 min readLW link

PSA: Pre­dic­tions mar­kets of­ten have very low liquidity; be care­ful cit­ing them.

Eye You16 Mar 2026 21:07 UTC
105 points
9 comments3 min readLW link

The Plan

Commander Zander16 Mar 2026 20:58 UTC
5 points
0 comments1 min readLW link

What Are My Values?

Corm16 Mar 2026 20:43 UTC
5 points
0 comments8 min readLW link

Three Prop­er­ties for Align­ment (and Why We’re Not Train­ing Them)

Quentin FEUILLADE--MONTIXI16 Mar 2026 20:26 UTC
8 points
4 comments3 min readLW link

Do LLMs Have Stable Prefer­ences?

Robert Gambee16 Mar 2026 20:09 UTC
6 points
0 comments7 min readLW link
(github.com)

The Fermi Para­dox Im­plies Domination

Noam Makavy16 Mar 2026 20:04 UTC
1 point
3 comments2 min readLW link

Ad­ding Ty­pos Made Haiku’s Ac­cu­racy Go Up

bira16 Mar 2026 18:31 UTC
31 points
3 comments3 min readLW link

What are the best ways to pub­lish ra­tio­nal fic­tion nowa­days?

Ihor Kendiukhov16 Mar 2026 18:13 UTC
14 points
11 comments3 min readLW link