RSS

Beth Barnes

Karma: 3,027

Alignment researcher. Views are my own and not those of my employer. https://​​www.barnes.page/​​

Bounty: Di­verse hard tasks for LLM agents

Dec 17, 2023, 1:04 AM
49 points
31 comments16 min readLW link

Send us ex­am­ple gnarly bugs

Dec 10, 2023, 5:23 AM
77 points
10 comments2 min readLW link

Manag­ing risks of our own work

Beth BarnesAug 18, 2023, 12:41 AM
66 points
0 comments2 min readLW link

ARC Evals new re­port: Eval­u­at­ing Lan­guage-Model Agents on Real­is­tic Au­tonomous Tasks

Beth BarnesAug 1, 2023, 6:30 PM
153 points
12 comments5 min readLW link
(evals.alignment.org)