RSS

Sunishchal Dev

Karma: 31

Im­prov­ing Model-Writ­ten Evals for AI Safety Benchmarking

15 Oct 2024 18:25 UTC
26 points
0 comments18 min readLW link