RSS

What Differ­en­ti­ates Hu­mans from Computers

Oscar Davies16 Jun 2026 21:26 UTC
−9 points
0 comments3 min readLW link

Two Clas­si­cal An­swers to “What do Two Vari­ables Share?”

Haru16 Jun 2026 20:02 UTC
8 points
0 comments5 min readLW link

Pre­dict­ing LLM Safety Be­fore Re­lease by Si­mu­lat­ing Deployment

16 Jun 2026 19:55 UTC
8 points
0 comments1 min readLW link

Tips for Crack­ing the AI Safety Tech­ni­cal Interview

16 Jun 2026 18:42 UTC
1 point
0 comments4 min readLW link

1 Layer In­duc­tion Heads and Some Research

16 Jun 2026 18:09 UTC
10 points
0 comments14 min readLW link

Claims all the way down

Jasper Blank16 Jun 2026 17:43 UTC
6 points
0 comments9 min readLW link

Ex­treme Ra­tion­al­ity: Still Not That Great

игорь тимофеев16 Jun 2026 16:41 UTC
14 points
1 comment40 min readLW link

An­gles of at­tack for con­tinual learn­ing safety

16 Jun 2026 16:15 UTC
32 points
0 comments13 min readLW link

Fable and Mythos: Model Welfare

Zvi16 Jun 2026 16:01 UTC
37 points
1 comment15 min readLW link
(thezvi.wordpress.com)

The de­sire to end the world

avturchin16 Jun 2026 14:56 UTC
12 points
11 comments2 min readLW link

Sim­pler User In­ter­faces in an AI Future

Adam Chlipala16 Jun 2026 14:48 UTC
1 point
0 comments7 min readLW link

A 400-year timeline of failed at­tempts to fix a lethal bug in the hu­man soft­ware of in­her­ited concepts

Bruce Middleton16 Jun 2026 13:44 UTC
21 points
3 comments5 min readLW link

How the AI Village works

Adam B16 Jun 2026 12:10 UTC
23 points
0 comments8 min readLW link
(theaidigest.org)

Ra­tion­al­ity Quotes, June ’26

Ben Pace16 Jun 2026 3:44 UTC
19 points
3 comments2 min readLW link

A Test Suite for Concepts

Gretta Duleba16 Jun 2026 2:41 UTC
45 points
7 comments6 min readLW link

In­vent­ing Consciousness

vasilisk16 Jun 2026 1:10 UTC
8 points
0 comments5 min readLW link

Syn­thetic doc­u­ment fine­tun­ing for in­still­ing pos­i­tive traits

16 Jun 2026 0:04 UTC
39 points
1 comment10 min readLW link

Does preser­va­tion make sense be­fore we know how to re­vive?

Aurelia15 Jun 2026 23:40 UTC
70 points
0 comments25 min readLW link

Find­ing pi and G in Mathland

Fernand015 Jun 2026 19:18 UTC
0 points
8 comments2 min readLW link

How Ma­tryoshka Sparse Au­toEn­coders Re­cover Fea­ture Hier­ar­chies That Vanilla SAEs Lose

baimamboukar15 Jun 2026 18:50 UTC
11 points
1 comment6 min readLW link