RSS

Can the Safety Tax Be Highly Con­cen­trated?

ozziegooen15 Jun 2026 18:48 UTC
7 points
0 comments2 min readLW link

A fron­tier AI com­pany should shut down

MichaelDickens15 Jun 2026 16:56 UTC
51 points
4 comments2 min readLW link

Links #3: 2026/​06 Part 1

papetoast15 Jun 2026 12:53 UTC
9 points
0 comments27 min readLW link

How re­al­ity turns to slop

julius vidal15 Jun 2026 10:42 UTC
8 points
3 comments4 min readLW link

On Re­spon­si­bil­ity and Death: Can We See Real­ity for What It Is or Will It Break Us

Dawn Drescher15 Jun 2026 10:14 UTC
6 points
0 comments3 min readLW link
(impartial-priorities.org)

VFUSE: Viru­lent Fea­ture Un­der­stand­ing With Sparse AutoEncoders

michaelwaves15 Jun 2026 5:06 UTC
11 points
0 comments2 min readLW link

The Power to Punish

Ben Pace15 Jun 2026 2:22 UTC
19 points
6 comments5 min readLW link

You need to know about the Baruch Plan

aggliu15 Jun 2026 1:21 UTC
26 points
1 comment3 min readLW link
(signoregalilei.com)

Ex­plor­ing Known Un­knowns in the AI Reg­u­la­tory Landscape

NelsonDP14 Jun 2026 22:36 UTC
6 points
0 comments22 min readLW link
(open.substack.com)

At­tack of the Killer Differ­en­tial Equations

Fernand014 Jun 2026 22:20 UTC
6 points
0 comments2 min readLW link

I built a pub­lic arena where peo­ple at­tack a “pro-hu­man” steer­ing direction

sohampadia10@gmail.com14 Jun 2026 21:26 UTC
1 point
0 comments9 min readLW link
(sohampadianeu-steering-arena.hf.space)

Why Do Naive SFT Filters For Safety Prop­er­ties Fail?

14 Jun 2026 19:45 UTC
41 points
0 comments10 min readLW link

Why I think a global AI pause (al­most) cer­tainly won’t happen

Expertium14 Jun 2026 19:20 UTC
19 points
0 comments2 min readLW link

Grad­ual dis­em­pow­er­ment at the scale of one user

ppal14 Jun 2026 18:01 UTC
10 points
0 comments4 min readLW link

How does con­gress­mem­ber use AI?

Ilyass Mofaddel14 Jun 2026 18:00 UTC
10 points
1 comment4 min readLW link

The Pos­ture of Thought

dongerous14 Jun 2026 18:00 UTC
12 points
0 comments5 min readLW link

The Dual-Use Gap

Yogesh Prabhu14 Jun 2026 17:43 UTC
5 points
0 comments4 min readLW link
(yogesh.bearblog.dev)

Can a stronger model fake be­ing a weaker one? Mostly not

Rob Kopel14 Jun 2026 17:30 UTC
8 points
0 comments7 min readLW link
(www.robkopel.me)

The 1890 Cen­sus as a fun cluster

Fernand014 Jun 2026 15:41 UTC
0 points
3 comments1 min readLW link

The Hid­den Struc­tures of Problems

spencerg14 Jun 2026 13:51 UTC
79 points
8 comments3 min readLW link
(www.spencergreenberg.com)