Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Jacob Pfau
Karma:
544
NYU PhD student working on AI safety
All
Posts
Comments
New
Top
Old
Auditing LMs with counterfactual search: a tool for control and ELK
Jacob Pfau
20 Feb 2024 0:02 UTC
28
points
6
comments
10
min read
LW
link
LM Situational Awareness, Evaluation Proposal: Violating Imitation
Jacob Pfau
26 Apr 2023 22:53 UTC
16
points
2
comments
2
min read
LW
link
Early situational awareness and its implications, a story
Jacob Pfau
6 Feb 2023 20:45 UTC
29
points
6
comments
3
min read
LW
link
Jacob Pfau’s Shortform
Jacob Pfau
17 Jun 2022 16:40 UTC
3
points
19
comments
1
min read
LW
link
Back to top