Adam Jermyn comments on RSPs are pauses done right

Adam Jermyn 14 Oct 2023 12:43 UTC
LW: 7 AF: 4
4
AF
Anthropic’s RSP includes evals after every 4x increase in effective compute and after every 3 months, whichever comes sooner, even if this happens during training, and the policy says that these evaluations include fine-tuning.
- Hoagy 16 Oct 2023 19:40 UTC
  LW: 1 AF: 1
  0
  AF Parent
  Do you know why 4x was picked? I understand that doing evals properly is a pretty substantial effort, but once we get up to gigantic sizes and proto-AGIs it seems like it could hide a lot. If there was a model sitting in training with 3x the train-compute of GPT4 I’d be very keen to know what it could do!