Maxime Riché comments on We need a Science of Evals

Maxime Riché 23 Jan 2024 11:57 UTC
LW: 9 AF: 3
0
AF
FYI, the “Evaluating Alignment Evaluations” project of the current AI Safety Camp is working on studying and characterizing alignment(propensity) evaluations. We hope to contribute to the science of evals, and we will contact you next month. (Somewhat deprecated project proposal)
- Marius Hobbhahn 23 Jan 2024 14:49 UTC
  LW: 3 AF: 1
  0
  AF Parent
  Nice work. Looking forward to that!