Roko comments on “AI Alignment” is a Dangerously Overloaded Term

Roko 16 Dec 2023 1:00 UTC
2 points
0

And once there is a STEM+ AI (which doesn’t need to itself be superintelligent, no more than humans are), superintelligence is at most a year away,

Why? Where does this number come from?
- Vladimir_Nesov 16 Dec 2023 1:27 UTC
  2 points
  0
  Parent
  A long training run, decades of human-speed algorithmic progress as initial algorithmic progress enables faster inference and online learning. I expect decades of algorithmic progress are sufficient to fit construction of superintelligence into 1e29 FLOPs with idiosyncratic interconnect. It’s approximately the same bet as superintelligence by the year 2100, just compressed within a year (as an OOM estimate) due to higher AI serial speed.
  - Roko 16 Dec 2023 3:42 UTC
    2 points
    0
    Parent
    But, the returns to that algorithmic progress diminish as we move up. It is Harder to improve something that is already good, than to take something really bad and apply the first big insight.
    
    How much benefit does AlphaZero have over Deep Blue with equal computational resources, as measured in ELO and in material?
    What links here?
    Vladimir_Nesov's comment on “AI Alignment” is a Dangerously Overloaded Term by Roko (16 Dec 2023 13:17 UTC; 14 points)
  - Gerald Monroe 16 Dec 2023 1:36 UTC
    2 points
    0
    Parent
    You don’t think you would need to evaluate a large number of “ASI candidates” to find an architecture that scales to superintelligence? Meaning I am saying you can describe every choice you make in architecture as single string, or “search space coordinate”. You would use a smaller model and proxy tasks, but you still need to train and evaluate each smaller model.
    
    All these failures might eat a lot of compute, how many failures do you think we would have? What if it was 10,000 failures and we need to reach gpt-4 scale to evaluate?
    
    Also, would “idiosyncratic interconnect” limit what tasks the model is superintelligent at? This would seem to imply a limit on how much information can be considered in one context. This might leave the model less than superintelligent at very complex, coupled tasks like “keep this human patient alive” while less coupled tasks like “design this IC from scratch” would work. (The chip design task is less coupled because you can subdivide into modules separated by interfaces and use separate ASI sessions for each module design)