Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Connor Axiotes
Karma:
2
All
Posts
Comments
New
Top
Old
Catastrophic Cyber Capabilities Benchmark (3CB): Robustly Evaluating LLM Agent Cyber Offense Capabilities
Jonathan N
,
abra
,
Connor Axiotes
and
Esben Kran
5 Nov 2024 1:01 UTC
8
points
0
comments
6
min read
LW
link
(www.apartresearch.com)
Back to top