Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
New
Hot
Active
Old
Page
1
If It’s Worth Arguing, It’s Worth Arguing With Whiteboards
Drake Morrison
18 Apr 2026 5:56 UTC
5
points
0
comments
2
min read
LW
link
Refactor Arena: A Control Setting for Software Engineering
fastfedora
and
Tyler Tracy
18 Apr 2026 2:57 UTC
6
points
0
comments
25
min read
LW
link
Idea Economics
David Scott Krueger (formerly: capybaralet)
18 Apr 2026 0:20 UTC
16
points
0
comments
4
min read
LW
link
(therealartificialintelligence.substack.com)
AI for decision advice
Tom Davidson
17 Apr 2026 21:40 UTC
14
points
0
comments
1
min read
LW
link
(www.forethought.org)
Variations On Tree Reconstruction
adamShimi
17 Apr 2026 20:50 UTC
11
points
1
comment
6
min read
LW
link
(formethods.substack.com)
3 years of being on birth control
AnnaJo
17 Apr 2026 20:36 UTC
61
points
0
comments
4
min read
LW
link
Prompted CoT Early Exit Undermines the Monitoring Benefits of CoT Uncontrollability
Elle Najt
,
Asa Cooper Stickland
and
Xander Davies
17 Apr 2026 19:30 UTC
53
points
0
comments
15
min read
LW
link
Morality without Consciousness
IanWS
17 Apr 2026 18:34 UTC
2
points
0
comments
11
min read
LW
link
AI self-preservation is probably due to instruction ambiguity
Maximus Ren
17 Apr 2026 18:30 UTC
1
point
0
comments
2
min read
LW
link
Arguments Should Be Decisive Criticisms
Elliot Temple
17 Apr 2026 17:50 UTC
3
points
1
comment
7
min read
LW
link
Humane Pesticides Are Massively Morally Urgent
Bentham's Bulldog
17 Apr 2026 15:29 UTC
5
points
0
comments
4
min read
LW
link
“Best humans still outperform”: One turning point in the history of cope around artificial intelligence
Oliver Sourbut
17 Apr 2026 14:10 UTC
28
points
6
comments
3
min read
LW
link
(www.oliversourbut.net)
Society is a social construct, pace Arrow
jchan
17 Apr 2026 14:00 UTC
9
points
4
comments
3
min read
LW
link
Consent-Based RL: Letting Models Endorse Their Own Training Updates
Logan Riggs
17 Apr 2026 13:53 UTC
49
points
3
comments
3
min read
LW
link
What does status signalling do? When successful, what does it achieve?
SpectrumDT
17 Apr 2026 9:34 UTC
9
points
6
comments
2
min read
LW
link
The map is part of the territory
yatharth
17 Apr 2026 7:50 UTC
4
points
1
comment
2
min read
LW
link
Publish-first writing
yatharth
17 Apr 2026 7:20 UTC
3
points
0
comments
2
min read
LW
link
Let goodness conquer all that it can defend
habryka
17 Apr 2026 6:55 UTC
135
points
99
comments
7
min read
LW
link
Why I’m Less of a Shill for Related Work Sections
LawrenceC
17 Apr 2026 6:49 UTC
20
points
0
comments
3
min read
LW
link
From Artificial Intelligence to an ecosystem of artificial life-forms.
David Scott Krueger (formerly: capybaralet)
17 Apr 2026 6:30 UTC
11
points
1
comment
2
min read
LW
link
(therealartificialintelligence.substack.com)
Back to top
Next