Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
METR (org)
Tag
Last edit:
1 Jul 2024 18:47 UTC
by
Ruby
Formerly ARC Evals
Relevant
New
Old
Review of METR’s public evaluation protocol
nahoj
and
JaimeRV
30 Jun 2024 22:03 UTC
10
points
0
comments
5
min read
LW
link
ARC Evals: Responsible Scaling Policies
Zach Stein-Perlman
28 Sep 2023 4:30 UTC
40
points
9
comments
2
min read
LW
link
(evals.alignment.org)
ARC Evals new report: Evaluating Language-Model Agents on Realistic Autonomous Tasks
Beth Barnes
1 Aug 2023 18:30 UTC
153
points
12
comments
5
min read
LW
link
(evals.alignment.org)
METR is hiring!
Beth Barnes
26 Dec 2023 21:00 UTC
65
points
1
comment
1
min read
LW
link
No comments.
Back to top