Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
AI Control
Tag
Last edit:
9 Mar 2024 0:47 UTC
by
Lauro Langosco
Relevant
New
Old
Critiques of the AI control agenda
Jozdien
14 Feb 2024 19:25 UTC
47
points
14
comments
9
min read
LW
link
Protocol evaluations: good analogies vs control
Fabien Roger
19 Feb 2024 18:00 UTC
35
points
10
comments
11
min read
LW
link
How useful is “AI Control” as a framing on AI X-Risk?
habryka
and
ryan_greenblatt
14 Mar 2024 18:06 UTC
67
points
4
comments
34
min read
LW
link
AXRP Episode 27 - AI Control with Buck Shlegeris and Ryan Greenblatt
DanielFilan
11 Apr 2024 21:30 UTC
69
points
10
comments
107
min read
LW
link
Auditing LMs with counterfactual search: a tool for control and ELK
Jacob Pfau
20 Feb 2024 0:02 UTC
28
points
6
comments
10
min read
LW
link
How to safely use an optimizer
Simon Fischer
28 Mar 2024 16:11 UTC
47
points
21
comments
7
min read
LW
link
No comments.
Back to top