RSS

Sunishchal Dev

Karma: 29

Im­prov­ing Model-Writ­ten Evals for AI Safety Benchmarking

15 Oct 2024 18:25 UTC
24 points
0 comments18 min readLW link