Gerald Monroe comments on Disagreement with bio anchors that lead to shorter timelines

Gerald Monroe 17 Nov 2022 2:08 UTC
3 points
1
It may also need a structured training environment and heuristic to select for generality.

The structured training environment is a set of tasks that train the machine a large breadth of base knowledge and skills to be a general AI.

The heuristic is just the point system : what metric are we selecting AI candidates by. Presumably we want a metric that selects simpler and smaller candidates with architecture that are heavily reused—something that looks like the topology of a brain—but maybe that won’t work.

So the explosion takes several things: compute, recursion, a software stack framework that is composable enough for automated design iteration, a bench, a heuristic.

Nukes weren’t really simple either, there were a lot of steps especially for the first implosion device. It took an immense amount of money and resources from the time physicists realized it was possible.

I think people are ignoring criticality because it hasn’t shown any gain in the history of ai because past systems were too simple. It’s not a proven track to success. What does work is bigass transformers.
- Tomás B. 17 Nov 2022 16:03 UTC
  4 points
  2
  Parent
  I suppose I expect recursive self-improvement to play out in the course of months not years. And I worry groups like OpenAI are insane enough to pursue recursive self improvement as an explicit engineering goal. (Altman seems to be a moral realist, explicitly says he thinks the orthogonality thesis is false.) From the outside, it will appear instant as there will be a perceived discontinuity when the fact that it has achieved a decisive strategic advantage becomes obvious.
  - Gerald Monroe 17 Nov 2022 17:35 UTC
    3 points
    0
    Parent
    Well again remember a nuclear device is a critical mass of weapons grade material.
    
    Anything less than weapons grade and nothing happens.
    
    Anything less than sudden explosive combination of the materials and the device will heat itself up and blast itself apart with sub kiloton yield.
    
    So analogy wise : current llms can “babble” out code that sometimes even works. They are not trained on RL selecting for correct and functional code.
    
    Self improvement by code generation isn’t yet possible.
    
    Other groups have tried making neural networks composable, and using one neural network based agent to design others. It is also not good enough for recursion but this is how autoML works.
    
    Basically our enrichment isn’t high enough and so nothing will happen. The recursion quenches itself before it can start, the first generation output isn’t even functional.
    
    But yes, at some future point in time it WILL be strong enough and crazy shit will happen. I mean think about the nuclear example: all those decades of discovering nuclear physics, fission, the chain reaction, building a nuclear reactor, purifying the plutonium...all that time and the interesting event happened in milliseconds.