Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Oliver Daniels
Karma:
108
PhD Student at Umass Amherst
All
Posts
Comments
New
Top
Old
Concrete Methods for Heuristic Estimation on Neural Networks
Oliver Daniels
14 Nov 2024 5:07 UTC
25
points
0
comments
27
min read
LW
link
Concrete empirical research projects in mechanistic anomaly detection
Erik Jenner
,
Viktor Rehnberg
and
Oliver Daniels
3 Apr 2024 23:07 UTC
43
points
3
comments
10
min read
LW
link
Oliver Daniels-Koch’s Shortform
Oliver Daniels
17 Mar 2024 17:24 UTC
2
points
12
comments
1
min read
LW
link
[Question]
Experiments to Test the Probability of Strategic Deceptive Misalignment?
Oliver Daniels
18 Jan 2024 0:13 UTC
2
points
0
comments
1
min read
LW
link
Back to top