Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Govind Pimpale
Karma:
307
All
Posts
Comments
New
Top
Old
Forecasting Frontier Language Model Agent Capabilities
Govind Pimpale
,
Axel Højmark
,
Jérémy Scheurer
and
Marius Hobbhahn
Feb 24, 2025, 4:51 PM
35
points
0
comments
5
min read
LW
link
(www.apolloresearch.ai)
Do models know when they are being evaluated?
Govind Pimpale
,
Giles
,
Joe Needham
and
Marius Hobbhahn
Feb 17, 2025, 11:13 PM
59
points
3
comments
12
min read
LW
link
Current safety training techniques do not fully transfer to the agent setting
Simon Lermen
and
Govind Pimpale
Nov 3, 2024, 7:24 PM
158
points
9
comments
5
min read
LW
link
~80 Interesting Questions about Foundation Model Agent Safety
RohanS
and
Govind Pimpale
Oct 28, 2024, 4:37 PM
46
points
4
comments
15
min read
LW
link
Analyzing DeepMind’s Probabilistic Methods for Evaluating Agent Capabilities
Axel Højmark
,
Govind Pimpale
,
Arjun Panickssery
,
Marius Hobbhahn
and
Jérémy Scheurer
Jul 22, 2024, 4:17 PM
69
points
0
comments
16
min read
LW
link
Back to top
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel