Zvi comments on RSPs are pauses done right

Zvi 14 Oct 2023 12:02 UTC
LW: 4 AF: 3
2
AF
Is evaluation of capabilities, which as you note requires fine-tuning and other such techniques, a realistic thing to properly do continuously during model training, without that being prohibitively slow or expensive? Would doing this be part of the intended RSP?
- Adam Jermyn 14 Oct 2023 12:43 UTC
  LW: 7 AF: 4
  4
  AF Parent
  Anthropic’s RSP includes evals after every 4x increase in effective compute and after every 3 months, whichever comes sooner, even if this happens during training, and the policy says that these evaluations include fine-tuning.
  - Hoagy 16 Oct 2023 19:40 UTC
    LW: 1 AF: 1
    0
    AF Parent
    Do you know why 4x was picked? I understand that doing evals properly is a pretty substantial effort, but once we get up to gigantic sizes and proto-AGIs it seems like it could hide a lot. If there was a model sitting in training with 3x the train-compute of GPT4 I’d be very keen to know what it could do!