Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Julian Stastny
Karma:
174
associate member of technical staff @ redwood research
All
Posts
Comments
New
Top
Old
Misalignment and Strategic Underperformance: An Analysis of Sandbagging and Exploration Hacking
Buck
and
Julian Stastny
May 8, 2025, 7:06 PM
75
points
1
comment
15
min read
LW
link
7+ tractable directions in AI control
Julian Stastny
and
ryan_greenblatt
Apr 28, 2025, 5:12 PM
83
points
1
comment
13
min read
LW
link
Disentangling four motivations for acting in accordance with UDT
Julian Stastny
Nov 5, 2023, 9:26 PM
35
points
3
comments
7
min read
LW
link
Back to top