Vladimir_Nesov comments on If we solve alignment, do we die anyway?

Vladimir_Nesov 31 Aug 2024 1:19 UTC
2 points
0

capable of RSI in the weak sense of being able to do capabilities research and help plan training runs

The speed at which this kind of thing is possible is crucial, even if capabilities are not above human level. This speed can make planning of training runs less central to the bulk of worthwhile activities. With very high speed, much more theoretical research that doesn’t require waiting for currently plannable training runs becomes useful, as well as things like rewriting all the software, even if models themselves can’t be “manually” retrained as part of this process. Plausibly at some point in the theoretical research you unlock online learning, even the kind that involves gradually shifting to a different architecture, and the inconvenience of distinct training runs disappears.

So this weak RSI would either need to involve AIs that can’t autonomously research, but can help the researchers or engineers, or the AIs need to be sufficiently slow and non-superintelligent that they can’t run through decades of research in months.
- eggsyntax 2 Sep 2024 22:06 UTC
  1 point
  0
  Parent
  This speed can make planning of training runs less central to the bulk of worthwhile activities. With very high speed, much more theoretical research that doesn’t require waiting for currently plannable training runs becomes useful
  It doesn’t seem clear to me that this is the case; there isn’t necessarily a faster way to precisely predict the behavior and capabilities of a new model than training it (other than crude measures like ‘loss on next-token prediction continues to decrease as the following function of parameter count’).
  It does seem possible and even plausible, but I think our theoretical understanding would have to improve enormously in order to make large advances without empirical testing.
  - Vladimir_Nesov 2 Sep 2024 22:15 UTC
    2 points
    0
    Parent
    I mean theoretical research on more general topics, not necessarily directly concerned with any given training run or even with AI. I’m considering the consequences of there being an AI that can do human level research in math and theoretical CS at much greater speed than humanity. It’s not useful when it’s slow, so that the next training run will make what little progress is feasible irrelevant, in the same way they don’t currently train frontier models for 2 years, since a bigger training cluster will get online in 1 and then outrun the older run. But with sufficient speed, catching up on theory from distant future can become worthwhile.
    - eggsyntax 2 Sep 2024 23:27 UTC
      3 points
      0
      Parent
      Oh, I see, I was definitely misreading you; thanks for the clarification!