Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
New
Hot
Active
Old
Page
1
Not telling is lying
Fernand0
13 Jun 2026 18:12 UTC
11
points
1
comment
3
min read
LW
link
A simple argument for trying less hard
Elias Schmied
13 Jun 2026 18:12 UTC
5
points
0
comments
3
min read
LW
link
How might continual learning affect safety and alignment?
Rauno Arike
,
RohanS
,
Owen Terry
,
Achu Menon
,
Zhijing Jin
,
Francis Rhys Ward
and
Seth Herd
13 Jun 2026 17:34 UTC
27
points
0
comments
16
min read
LW
link
Presentfulness: Lucidity, Osmosis, and Dissociation
Astrid Callender
13 Jun 2026 17:21 UTC
4
points
0
comments
5
min read
LW
link
How to Suffer Less
Gordon Seidoh Worley
13 Jun 2026 17:10 UTC
16
points
0
comments
6
min read
LW
link
(www.uncertainupdates.com)
Somewhat Contra Ted Chiang on AI Consciousness
ThomasJ
13 Jun 2026 16:49 UTC
5
points
0
comments
10
min read
LW
link
The term “AGI” is almost useless at this point [Linkpost]
Noosphere89
13 Jun 2026 16:15 UTC
37
points
1
comment
4
min read
LW
link
(helentoner.substack.com)
AML for AI as a verification mechanism
MarkelKori
13 Jun 2026 11:59 UTC
9
points
2
comments
2
min read
LW
link
Pulling hedonic utilitarianism out of ethical emotivism
Bill Jackson
13 Jun 2026 11:50 UTC
6
points
0
comments
6
min read
LW
link
(billjackson7.substack.com)
Tequila Sunset at the Hog’s Head (A Scene)
Ben Pace
13 Jun 2026 6:53 UTC
12
points
0
comments
5
min read
LW
link
Sandy Blvd as an example of complexity
Adam Zerner
13 Jun 2026 0:28 UTC
9
points
0
comments
2
min read
LW
link
What’s Continual Learning, and Why Might We Expect To See It In Advanced LLM Agents?
RohanS
,
Rauno Arike
,
Owen Terry
,
Achu Menon
,
Zhijing Jin
,
Francis Rhys Ward
and
Seth Herd
12 Jun 2026 18:43 UTC
24
points
2
comments
17
min read
LW
link
Implications of Continual Learning for LLM Agents: Introduction
RohanS
,
Rauno Arike
,
Owen Terry
,
Achu Menon
,
Zhijing Jin
,
Francis Rhys Ward
and
Seth Herd
12 Jun 2026 18:36 UTC
43
points
0
comments
6
min read
LW
link
Reward Hacking at the 1937 World’s Fair
frmsaul
12 Jun 2026 17:47 UTC
35
points
5
comments
3
min read
LW
link
Bunk in AF
Fernand0
12 Jun 2026 17:41 UTC
6
points
0
comments
1
min read
LW
link
Building and evaluating model diffing agents
bilalchughtai
,
Josh Engels
and
Neel Nanda
12 Jun 2026 17:14 UTC
54
points
2
comments
12
min read
LW
link
“AF needs empirical grounding” is a meaningless valley of compromise
Fernand0
12 Jun 2026 16:37 UTC
8
points
2
comments
1
min read
LW
link
How bad would it be if GPS satellites were shot down?
Jackson Wagner
12 Jun 2026 16:34 UTC
17
points
0
comments
21
min read
LW
link
Sympathy for both sides of the egregious misalignment debate
Steven Byrnes
12 Jun 2026 16:26 UTC
160
points
15
comments
4
min read
LW
link
The Uncertainty That Matters Isn’t Fundamental
jimmy
12 Jun 2026 16:23 UTC
28
points
1
comment
13
min read
LW
link
Back to top
Next