RSS

Jonathan N

Karma: 3

Catas­trophic Cy­ber Ca­pa­bil­ities Bench­mark (3CB): Ro­bustly Eval­u­at­ing LLM Agent Cy­ber Offense Capabilities

5 Nov 2024 1:01 UTC
8 points
0 comments6 min readLW link
(www.apartresearch.com)

ACX Meetup 2022 @ Singapore

Jonathan N24 Aug 2022 7:14 UTC
2 points
0 comments1 min readLW link