ryan_greenblatt comments on The case for ensuring that powerful AIs are controlled

ryan_greenblatt 28 Jan 2024 17:18 UTC
LW: 7 AF: 6
2
AF
Overall, this has updated me to some extent and it seems less plausible to me that ML research can achieve 30x speedups while having human researchers do all of the high level ideas. (I think the picture looks importantly better when AIs are removing this bottleneck.)

I think the most plausible case for your position would be that the compute costs for ML research scale much less than quadratically

The situation I was imagining is where most experiments use some combination of:
- A relatively small amount of finetuning/inference on the biggest models (including for the actual AI researcher)
- Larger (possibly full) training runs, but at much smaller scale (e.g. GPT-3 level performance models)
Then, we can in total afford ~training dataset sized amounts of finetuning/inference for the biggest models (by the inference availability argument). And GPT-3 performance experiments will be pretty cheap. So assuming our base model looks like GPT-6 with the expected compute requirement and model size, this is a huge amount of possible inference availability.

So, the implicit claim is that compute costs scale much less than quadratically. It’s certainly not obvious ML research can be progressed fast enough with this little compute.

Another possibility is that human ML researchers get really good at multi-tasking

I was certainly imagining at least some amount of multi-tasking (e.g. 4 projects at once each of which runs 8x faster). This doesn’t feel that crazy to me, I already do a moderate amount of multi-tasking.

Currently it seems feasible to make a paper’s worth of progress in ~6 months.

Note that this often involves multiple people working on the same paper. In the AI case, the division of labor might look at least somewhat different. (Though I don’t think this changes the picture very much from what you’re describing because most people now aren’t the “ideas” people.)
- Rohin Shah 28 Jan 2024 18:23 UTC
  LW: 3 AF: 3
  0
  AF Parent
  Cool, that all roughly makes sense to me :)
  I was certainly imagining at least some amount of multi-tasking (e.g. 4 projects at once each of which runs 8x faster). This doesn’t feel that crazy to me, I already do a moderate amount of multi-tasking.
  Multi-tasking where you are responsible for the entire design of the project? (Designing the algorithm, choosing an experimental setting and associated metrics, knowing the related work, interpreting the results of the experiments, figuring out what the next experiment should be, …)
  Suppose today I gave you a device where you put in moderately detailed instructions for experiments, and the device returns the results^[1] with N minutes of latency and infinite throughput. Do you think you can spend 1 working day using this device to produce the same output as 4 copies of yourself working in parallel for a week (and continue to do that for months, after you’ve exhausted low-hanging fruit)?
  … Having written this hypothetical out, I am finding it more plausible than before, at least for small enough N, though it still feels quite hard at e.g. N = 60.
  1. ^
    The experiments can’t use too much compute. No solving the halting problem.
  - ryan_greenblatt 28 Jan 2024 18:38 UTC
    LW: 4 AF: 4
    2
    AF Parent
    Probably yes for realistic values of N?
    
    Assuming the box is pretty smart at understanding instructions (and has an understanding of my typical ontology to the extent that you would get after working with me a few weeks and reading various posts) and the box will ask follow-up questions in cases where the instructions are unclear. (And we can do small diffs with reduced latency like asking the results to be plotted in a different way.)
    
    My main concern is running out of ideas after a while despite copies of myself with more thinking time having more time to generate ideas.
    - Rohin Shah 28 Jan 2024 18:44 UTC
      LW: 2 AF: 2
      0
      AF Parent
      Sounds reasonable, though idk what you think realistic values of N are (my wild guess with hardly any thought is 15 minutes − 1 day).
      EDIT: Tbc in the 1 day case I’m imagining that most of the time goes towards running the experiment—it’s more a claim about what experiments we want to run. If we just talk about the time to write the code and launch the experiment I’m thinking of N in the range of 5 minutes to 1 hour.