Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Govind Pimpale
Karma:
238
All
Posts
Comments
New
Top
Old
Current safety training techniques do not fully transfer to the agent setting
Simon Lermen
and
Govind Pimpale
3 Nov 2024 19:24 UTC
156
points
8
comments
5
min read
LW
link
~80 Interesting Questions about Foundation Model Agent Safety
RohanS
and
Govind Pimpale
28 Oct 2024 16:37 UTC
45
points
4
comments
15
min read
LW
link
Analyzing DeepMind’s Probabilistic Methods for Evaluating Agent Capabilities
Axel Højmark
,
Govind Pimpale
,
Arjun Panickssery
,
Marius Hobbhahn
and
Jérémy Scheurer
22 Jul 2024 16:17 UTC
69
points
0
comments
16
min read
LW
link
Back to top