Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
New
Hot
Active
Old
Page
1
Can the Safety Tax Be Highly Concentrated?
ozziegooen
15 Jun 2026 18:48 UTC
7
points
0
comments
2
min read
LW
link
A frontier AI company should shut down
MichaelDickens
15 Jun 2026 16:56 UTC
51
points
4
comments
2
min read
LW
link
Links #3: 2026/06 Part 1
papetoast
15 Jun 2026 12:53 UTC
9
points
0
comments
27
min read
LW
link
How reality turns to slop
julius vidal
15 Jun 2026 10:42 UTC
8
points
3
comments
4
min read
LW
link
On Responsibility and Death: Can We See Reality for What It Is or Will It Break Us
Dawn Drescher
15 Jun 2026 10:14 UTC
6
points
0
comments
3
min read
LW
link
(impartial-priorities.org)
VFUSE: Virulent Feature Understanding With Sparse AutoEncoders
michaelwaves
15 Jun 2026 5:06 UTC
11
points
0
comments
2
min read
LW
link
The Power to Punish
Ben Pace
15 Jun 2026 2:22 UTC
19
points
6
comments
5
min read
LW
link
You need to know about the Baruch Plan
aggliu
15 Jun 2026 1:21 UTC
26
points
1
comment
3
min read
LW
link
(signoregalilei.com)
Exploring Known Unknowns in the AI Regulatory Landscape
NelsonDP
14 Jun 2026 22:36 UTC
6
points
0
comments
22
min read
LW
link
(open.substack.com)
Attack of the Killer Differential Equations
Fernand0
14 Jun 2026 22:20 UTC
6
points
0
comments
2
min read
LW
link
I built a public arena where people attack a “pro-human” steering direction
sohampadia10@gmail.com
14 Jun 2026 21:26 UTC
1
point
0
comments
9
min read
LW
link
(sohampadianeu-steering-arena.hf.space)
Why Do Naive SFT Filters For Safety Properties Fail?
Josh Engels
and
Neel Nanda
14 Jun 2026 19:45 UTC
41
points
0
comments
10
min read
LW
link
Why I think a global AI pause (almost) certainly won’t happen
Expertium
14 Jun 2026 19:20 UTC
19
points
0
comments
2
min read
LW
link
Gradual disempowerment at the scale of one user
ppal
14 Jun 2026 18:01 UTC
10
points
0
comments
4
min read
LW
link
How does congressmember use AI?
Ilyass Mofaddel
14 Jun 2026 18:00 UTC
10
points
1
comment
4
min read
LW
link
The Posture of Thought
dongerous
14 Jun 2026 18:00 UTC
12
points
0
comments
5
min read
LW
link
The Dual-Use Gap
Yogesh Prabhu
14 Jun 2026 17:43 UTC
5
points
0
comments
4
min read
LW
link
(yogesh.bearblog.dev)
Can a stronger model fake being a weaker one? Mostly not
Rob Kopel
14 Jun 2026 17:30 UTC
8
points
0
comments
7
min read
LW
link
(www.robkopel.me)
The 1890 Census as a fun cluster
Fernand0
14 Jun 2026 15:41 UTC
0
points
3
comments
1
min read
LW
link
The Hidden Structures of Problems
spencerg
14 Jun 2026 13:51 UTC
79
points
8
comments
3
min read
LW
link
(www.spencergreenberg.com)
Back to top
Next