RSS

Learn­ing zero, and what SLT gets wrong about it

Dmitry Vaintrob29 Apr 2026 6:41 UTC
11 points
1 comment13 min readLW link

Are LLMs not get­ting bet­ter?

kqr29 Apr 2026 6:27 UTC
16 points
2 comments2 min readLW link

llm as­sis­tant per­sonas seem in­creas­ingly in­co­her­ent (some sub­jec­tive ob­ser­va­tions)

nostalgebraist29 Apr 2026 3:53 UTC
60 points
4 comments9 min readLW link

The AI x-risk law­suit wait­ing to happen

David Scott Krueger29 Apr 2026 3:50 UTC
13 points
0 comments2 min readLW link
(therealartificialintelligence.substack.com)

Not a Paper: “Fron­tier Lab CEOs are Ca­pable of In-Con­text Schem­ing”

LawrenceC29 Apr 2026 3:00 UTC
74 points
2 comments7 min readLW link

Notes on Trans­former Consciousness

slavachalnev29 Apr 2026 0:00 UTC
23 points
1 comment2 min readLW link

Se­cureMaxx: A Lightweight Se­quence Screen­ing Tool for Agents

Austin Morrissey28 Apr 2026 23:47 UTC
3 points
0 comments8 min readLW link

Will whole brain em­u­la­tion mat­ter for the AI tran­si­tion?

djbinder28 Apr 2026 23:04 UTC
19 points
0 comments41 min readLW link
(defensesindepth.bio)

Causal in­fer­ence di­ary: skiing causes snow

Gretta Duleba28 Apr 2026 22:21 UTC
15 points
1 comment8 min readLW link

Is AI welfare work puntable?

Oscar28 Apr 2026 21:17 UTC
11 points
2 comments7 min readLW link

The Prob­lem in the “Nerd Sniping” xkcd Comic

peralice28 Apr 2026 20:40 UTC
50 points
3 comments12 min readLW link

Com­ment on “Fore­cast­ing is Way Over­rated, and We Should Stop Fund­ing It”

Josh Rosenberg28 Apr 2026 20:16 UTC
22 points
0 comments9 min readLW link

In­tro­spec­tion Adapters: Train­ing LLMs to Re­port Their Learned Behaviors

28 Apr 2026 19:02 UTC
21 points
0 comments12 min readLW link
(alignment.anthropic.com)

Re­cur­sive fore­cast­ing: Elic­it­ing long-term fore­casts from my­opic fit­ness-seekers

28 Apr 2026 18:00 UTC
55 points
2 comments7 min readLW link

No­body ever checked

Cameron Berg28 Apr 2026 17:15 UTC
24 points
15 comments8 min readLW link
(camberg.substack.com)

Mon­day AI Radar #23

Against Moloch28 Apr 2026 17:12 UTC
4 points
0 comments6 min readLW link
(againstmoloch.com)

An Align­ment Jour­nal: Adap­ta­tion to AI

28 Apr 2026 17:04 UTC
22 points
1 comment13 min readLW link
(blog.alignmentjournal.org)

Fron­tier Cod­ing Agents Can Now Im­ple­ment an AlphaZero Self-Play Ma­chine Learn­ing Pipeline For Con­nect Four That Performs Com­pa­rably to an Ex­ter­nal Solver

28 Apr 2026 16:50 UTC
25 points
0 comments36 min readLW link

Takes from two months as an as­piring LLM naturalist

AnnaSalamon28 Apr 2026 16:14 UTC
95 points
17 comments8 min readLW link

SAEBER: Sparse Au­toen­coders for Biolog­i­cal En­tity Risk

michaelwaves28 Apr 2026 14:43 UTC
8 points
0 comments8 min readLW link