Eliezer Yudkowsky comments on Non-orthogonality implies uncontrollable superintelligence

Eliezer Yudkowsky 2 May 2012 1:45 UTC
15 points
And that for every X except x0, it is mysteriously impossible to build any computational system which generates a range of actions, predicts the consequences of those actions relative to some ontology and world-model, and then selects among probable consequences using criterion X.
What links here?
- private_messaging's comment on Do people think Less Wrong rationality is parochial? by lukeprog (5 May 2012 19:36 UTC; 2 points)
- Wei Dai 3 May 2012 22:46 UTC
  8 points
  Parent
  It sounds implausible when you put it like that, but suppose the only practical way to build a superintelligence is through some method that severely constrains the possible goals it might have (e.g., evolutionary methods, or uploading the smartest humans around and letting them self-modify), and attempts to build general purpose AIs/oracles/planning tools get nowhere (i.e., fail to be competitive against humans) until one is already a superintelligence.
  
  Maybe when Bostrom/Armstrong/Yudkowsky talk about “possibility” in connection with the orthogonality thesis, they’re talking purely about theoretical possibility as opposed to practical feasibility. In fact Bostrom made this disclaimer in a footnote:
  
  The orthogonality thesis implies that most any combination of final goal and intelligence level is logically possible; it does not imply that it would be practically easy to endow a superintelligent agent with some arbitrary or human-respecting final goal—even if we knew how to construct the intelligence part.
  
  But then who are they arguing against? Are there any AI researchers who think that even given unlimited computing power and intelligence on the part of the AI builder, it’s still impossible to create AIs with arbitrary (or diverse) goals? This isn’t Pei Wang’s position, for example.
  - TheAncientGeek 30 Sep 2013 17:39 UTC
    1 point
    Parent
    There are multiple variations on the OT, and the kind that just say it is possible can’t support the UFAI argument. The UFAI argument is conjunctive, and each stage in the conjunction needs to have a non-neglible probability, else it is a Pascal’s Mugging
- thomblake 2 May 2012 15:57 UTC
  3 points
  Parent
  I don’t think I’ve seen that particular reversal of the position before. Neat.
- Stuart_Armstrong 2 May 2012 11:27 UTC
  3 points
  Parent
  Yep. I’m calling that the “no Oracle, no general planning” position in my paper.
- private_messaging 4 May 2012 6:51 UTC
  −5 points
  Parent
  
  build any computational system which generates a range of actions, predicts the consequences of those actions relative to some ontology and world-model, and then selects among probable consequences using criterion X.
  
  Nothing mysterious here: this naive approach has incredibly low payoff per computation, and even if you start with such system, and get it to be smart enough to make improvements, the first thing it’ll be improving is changing it’s architecture.
  
  If I gave you 10^40 flops, which probably can support ‘super intelligent’ mind, your naive approach would still be dumber than a housecat on many tasks. For some world evolution & utility, you can do inverse of the ‘simulate and choose’ much better (think towering exponents times better) than brute-force ‘try different actions’. In general you can’t. Some functions are easier to find inverse of, than others. A lot easier.
  What links here?
  - private_messaging's comment on Do people think Less Wrong rationality is parochial? by lukeprog (4 May 2012 9:51 UTC; 0 points)