LLMs as amplifiers, not assistants

Caleb Biddulph19 Jun 2025 17:21 UTC

9 points

0 comments7 min readLW link

How The Singer Sang His Tales

adamShimi19 Jun 2025 17:06 UTC

18 points

0 comments36 min readLW link

(formethods.substack.com)

Key paths, plans and strategies to AI safety success

Adam Jones19 Jun 2025 16:56 UTC

5 points

0 comments6 min readLW link

(bluedot.org)

AI safety techniques leveraging distillation

ryan_greenblatt19 Jun 2025 14:31 UTC

53 points

0 comments12 min readLW link

Political Funding Expertise (Post 6 of 7 on AI Governance)

Mass_Driver19 Jun 2025 14:14 UTC

14 points

0 comments14 min readLW link

Documents Are Dead. Long Live the Conversational Proxy.

8harath19 Jun 2025 14:01 UTC

−8 points

1 comment1 min readLW link

A deep critique of AI 2027’s bad timeline models

titotal19 Jun 2025 13:29 UTC

176 points

2 comments39 min readLW link

(titotal.substack.com)

AI can win a conflict against us

Algon, steven0461 and Vishakha

19 Jun 2025 7:20 UTC

4 points

0 comments2 min readLW link

Different goals may bring AI into conflict with us

Algon, steven0461 and Vishakha

19 Jun 2025 7:19 UTC

5 points

0 comments2 min readLW link

My Failed AI Safety Research Projects (Q1/Q2 2025)

Adam Newgas19 Jun 2025 3:55 UTC

17 points

0 comments3 min readLW link

On May 1, 2033, humanity discovered that AI had been aligned by default.

Yitz18 Jun 2025 19:57 UTC

11 points

2 comments1 min readLW link

New Ethics for the AI Age

Matthieu Tehenan18 Jun 2025 19:30 UTC

1 point

0 comments6 min readLW link

Factored Cognition Strengthens Monitoring and Thwarts Attacks

Aaron Sandoval and Cody Rushing

18 Jun 2025 18:28 UTC

23 points

0 comments25 min readLW link

Sparsely-connected Cross-layer Transcoders

jacob_drori18 Jun 2025 17:13 UTC

41 points

2 comments12 min readLW link

Moral Alignment: An Idea I’m Embarrassed I Didn’t Think of Myself

Gordon Seidoh Worley18 Jun 2025 15:42 UTC

14 points

50 comments2 min readLW link

This was meant for you

Logan Kieller18 Jun 2025 15:26 UTC

3 points

0 comments8 min readLW link

(agenticconjectures.substack.com)

Children of War: Hidden dangers of an AI arms race

Peter Kuhn18 Jun 2025 15:19 UTC

4 points

0 comments7 min readLW link

Fictional Thinking and Real Thinking

johnswentworth17 Jun 2025 19:13 UTC

50 points

8 comments4 min readLW link

The Curious Case of the bos_token

larry-dial17 Jun 2025 19:00 UTC

11 points

1 comment10 min readLW link

Comparing Sparse Autoencoder Features from Individual and Combined Datasets

Greg B17 Jun 2025 18:41 UTC

1 point

0 comments9 min readLW link