Zac Hatfield-Dodds comments on AI #34: Chipping Away at Chip Exports

Zac Hatfield-Dodds 19 Oct 2023 20:50 UTC
16 points
12

Adam Jermyn says Anthropic’s RSP includes fine-tuning-included evals every three months or 4x compute increase, including during training.

You don’t need to take anyone’s word for this when checking the primary source is so easy: the RSP is public, and the relevant protocol is on page 12:

In more detail, our evaluation protocol is as follows: … Timing: During model training and fine-tuning, Anthropic will conduct an evaluation of its models for next-ASL capabilities both (1) after every 4x jump in effective compute, including if this occurs mid-training, and (2) every 3 months to monitor fine-tuning/tooling/etc improvements.
- Zvi 20 Oct 2023 13:30 UTC
  2 points
  0
  Parent
  In this case yes, I should have checked the primary source directly, it was worth the effort—I’ve learned to triage such checks but got this one wrong given that I already had the primary source handy.