David Althaus comments on Q&A with Michael Littman on risks from AI

David Althaus 15 Jan 2012 22:04 UTC
0 points

I can only voice some doubts and questions.

Just wanted to say, that I think it’s great that you voice questions and doubts. Most folks who don’t agree with the “party-line” on LW, or substantial amounts thereof, probably just leave.

I do not have the necessary education to evaluate state of the art AI research and to grasp associated fields that are required to make predictions about the nature of possible AI’s capable of self-modification.

I don’t have the necessary education either. But you can always make predictions, even if you know almost nothing about the topic in question. You just have to widen your confidence intervalls! :-)
- XiXiDu 16 Jan 2012 9:42 UTC
  0 points
  Parent
  
  Most folks who don’t agree with the “party-line” on LW, or substantial amounts thereof, probably just leave.
  
  Yes, I was talking to people on Facebook who just “left”.
  
  But you can always make predictions, even if you know almost nothing about the topic in question.
  
  The problem is that I find most of the predictions being made convincing, but only superficially so. There seem to be a lot of hidden assumptions.
  
  If you were going to speed up a chimp brain a million times, would it quickly reach human-level intelligence? I don’t think so. Why would it be different for a human-level intelligence trying to reach transhuman intelligence? It seems like a nice idea when formulated in English, but would it work?
  
  Just because we understand Chess_intelligence we do not understand Human_intelligence. As I see it, either there is a single theory of general intelligence and improving it is just a matter at throwing more resources at it or different levels are fundamentally different and you can’t just interpolate Go_intelligence from Chess_intelligence...
  
  Even if we assume that there is one complete theory of general intelligence. Once discovered, one just has to throw more resources at it. It might be able to incorporate all human knowledge, adapt it and find new patterns. But would it really be vastly superior to human society and their expert systems?
  
  Take for example a Babylonian mathematician. If you traveled back in time and were to accelerate his thinking a million times, would he discover place-value notation to encode numbers in a few days? I doubt it. Even if he was to digest all the knowledge of his time in a few minutes, I just don’t see him coming up with quantum physics after a short period of time.
  
  That conceptual revolutions are just a matter of computational resources seems like pure speculation. If one were to speed up the whole Babylonian world and accelerate cultural evolution, obviously one would arrive quicker at some insights. But how much quicker? How much are many insights dependent on experiments, to yield empirical evidence, that can’t be speed-up considerably? And what is the return? Is the payoff proportionally to the resources that are necessary?
  
  Another problem is if one can improve intelligence itself apart from solving well-defined problems and making more accurate predictions on well-defined classes of problems. I don’t think the discovery of unknown unknowns is subject to other heuristics than natural selection. Without goals, well-defined goals, terms like “optimization” have no meaning.
  
  Without well-defined goals in form of a precise utility-function, I don’t think it would be possible to maximize expected “utility”. Concepts like “efficient”, “economic” or “self-protection” all have a meaning that is inseparable with an agent’s terminal goals. If you just tell it to maximize paperclips then this can be realized in an infinite number of ways that would all be rational given imprecise design and goal parameters. Undergoing to explosive recursive self-improvement, taking over the universe and filling it with paperclips, is just one outcome. Why would an arbitrary mind pulled from mind-design space care to do that? Why not just wait for paperclips to arise due to random fluctuations out of a state of chaos? That wouldn’t be irrational. To have an AI take over the universe as fast as possible you would have to explicitly design it to do so.
  - Emile 16 Jan 2012 10:09 UTC
    3 points
    Parent
    I don’t think that the LW “party line” is that mere additional computational resources are sufficient to get superintelligence or even just intelligence (I’d find such a view simplistic and a bit naive, but I don’t find Eliezer’s views simplistic and naive).
    
    I think that it’s pretty likely that today’s hardware would be in theory sufficient to run roughly human level or superhuman intelligence (in the broad sense, “could do most intellectual jobs human do today” for example), though that doesn’t mean humans are likely to make them anytime soon (just like, if you teleported a competent engineer back in ancient Greece, he would be able to make some amazing device with the technology of the time, even if that doesn’t mean the Greeks were about to invent those things).
    
    I do think that as computational resources increase, the number of ways of designing minds increases, so it makes it more and more likely that someone will eventually figure out how to make something AGIish. But that’s not the same as saying that “just increase computational resources and it’ll work!”.
    
    For an analogy, it may have been possible to build an internal explosion engine with 1700-time technology, but as time went by and the precision of measurement and manufacturing tools increased, it became easier and easier to do. That doesn’t mean that making 1700-era manufacturing machinery have modern levels of precision would be enough to allow them to build an internal explosion engine.
    - XiXiDu 16 Jan 2012 11:43 UTC
      1 point
      Parent
      
      ...but I don’t find Eliezer’s views simplistic and naive...
      
      My whole problem is that some people seem to have high confidence in following idea voiced by Eliezer:
      
      I think that at some point in the development of Artificial Intelligence, we are likely to see a fast, local increase in capability—“AI go FOOM”. Just to be clear on the claim, “fast” means on a timescale of weeks or hours rather than years or decades; and “FOOM” means way the hell smarter than anything else around, capable of delivering in short time periods technological advancements that would take humans decades, probably including full-scale molecular nanotechnology (that it gets by e.g. ordering custom proteins over the Internet with 72-hour turnaround time).
      
      I do not doubt that it is a possibility but I just don’t see how people justify to be very confident about it. It sure sounds nice when formulated in English. But is it the result of disjunctive reasoning? I perceive it to be conjunctive, a lot of assumptions have to turn out to be correct to make humans discover simple algorithms over night that can then be improved to self-improve explosively. I would compare that to the idea of a Babylonian mathematician discovering modern science and physics given that he would be uploaded into a supercomputer. I believe that to be highly speculative. It assumes that he could brute-force conceptual revolutions. Even if he was given a detailed explanation of how his mind works and the resources to understand it, self-improving to achieve superhuman intelligence assumes that throwing resources at the problem of intelligence will magically allow him to pull improved algorithms from solution space as if they were signposted. But unknown unknowns are not signposted. It’s rather like finding a needle in a haystack. Evolution is great at doing that and assuming that one could speed up evolution considerably is another assumption about technological feasibility and real-world resources.
      - TheOtherDave 16 Jan 2012 15:50 UTC
        5 points
        Parent
        OK, so here are some assumptions, stated as disjunctively as I can:
        
        1: Humans have, over the last hundred years, created systems in the world that are intended to achieve certain goals. Call those systems “technology” for convenience.
        
        2: At least some technology is significantly more capable of achieving the goals it’s intended to achieve than its closest biologically evolved analogs. For example, technological freight-movers can move more freight further and faster than biologically evolved ones.
        
        3: For the technology described in assumption 2, biological evolution would have required millenia to develop equivalently capable systems for achieving the goals of that technology.
        
        4: Human intelligence (rather than other things such as, for example, human musculature or covert intervention by technologically advanced aliens) is primarily responsible for the creation of technology described in assumption 2.
        
        5: Technology analogous to the technology-developing functions of human intelligence is in principle possible.
        
        6: Technological technology-developers, if developed, will be significantly more capable of developing technology than human intelligence is.
        
        Here are some assertions of confidence of these assumptions:
        
        A1: 1-epsilon.
        A2: 1-epsilon.
        A3, given A2: .99+
        A4, given A2 : ~.9
        A5 given A4: .99+
        A6 given A5: .95+
        
        I conclude a .8+ confidence that it’s in principle possible for humans to develop systems that are significantly more capable of delivering technological developments than humans are.
        
        I’ll pause there and see if we’ve diverged thus far: if you have different confidence levels for the assumptions I’ve stated, I’m interested in yours. If you don’t believe that my conclusion follows from the assumptions I’ve stated, I’m interested in why not.
        XiXiDu 16 Jan 2012 18:41 UTC
        3 points
        Parent
        You can’t really compare technological designs for which there was no selection pressure and therefore no optimization with superficially similar evolutionary inventions. For example, you would have to compare the energy efficiency with which insects or birds can carry certain amounts of weight with a similar artificial means of transport carrying the same amount of weight. Or you would have to compare the energy efficiency and maneuverability of bird and insect flight with artificial flight. But comparing a train full of hard disk drives with the bandwidth of satellite communication is not useful. Saying that a rocket can fly faster than anything that evolution came up with is not generalizable to intelligence. And if even if I was to accept that argument, then there are many counter-examples. The echolocation of bats, economic photosynthesis or human gait. And the invention of rockets did not led to space colonization either, space exploration is actually retrogressive.
        
        You also mention that human intelligence is primarily responsible for the creation of technology. I do think this is misleading. What is responsible is that we are goal-oriented while evolution is not. But the advance of scientific knowledge is largely an evolutionary process. I don’t see that intelligence is currently tangible enough to measure that the return of increased intelligence is proportional to the resources it would take to amplify it. The argument from the gap between chimpanzees and humans is interesting but can not be used to extrapolate onwards from human general intelligence. It is pure speculation that humans are not Turing complete and that there are levels above our own. That chimpanzees exist, and humans exist, is not a proof for the existence of anything that bears, in any relevant respect, the same relationship to a human that a human bears to a chimpanzee.
        
        It is in principle possible to create artificial intelligence that is as capable as human intelligence. But this says nothing about how quickly we will be able to come up with it. I believe that intelligence is fundamentally dependent on the complexity of the goals against which it is measured. Goals give rise to agency and define an agent’s drives. As long as we won’t be able to precisely hard-code a complexity of values similar to that of humans we won’t achieve levels of general intelligence similar to humans.
        
        It is true that humans have created a lot of tools that help them to achieve their goals. But it is not clear that incorporating those tools into some sort of self-perception, some sort of guiding agency, is superior to humans using a combination of tools and expert systems. In other words, it is not clear that there does exist a class of problems that is solvable by Turing machines in general, but not by a combination of humans and expert systems. And if that was the case then I think that, just like chimpanzees would be unable to invent science, we won’t be able to come up with a meta-heuristic that would allow us to discover algorithms that can solve a class of problems that we can’t (other than by using guided evolution).
        
        Besides, recursive self-improvement does not demand sentience, consciousness or agency. Even if humans are not able to “recursively improve” their own algorithms we can still “recursively improve” our tools. And the supremacy of recursively improving agent’s over humans and their tools is a reasonable conjecture but not a fact. It largely relies on the idea that the integration of tools into a coherent framework of agencies has huge benefits.
        
        I also object to assigning numerical probability estimates to informal arguments and predictions. When faced with data from empirical experiments, or goats behind doors in a gameshow, it is reasonable. But using formalized methods to evaluate informal evidence can be very misleading. For real-world, computationally limited agents it is a recipe to fail spectacularly. Using formalized methods to to evaluate vague ideas like risks from AI can lead you to dramatically over or underestimate evidence by forcing you to use your intuition to assign numbers to your intuitive judgement of informal arguments.
        
        And as a disclaimer: Don’t jump to the conclusion that I generally rule out the possibility that very soon someone will stumble upon a simple algorithm that can be run on a digital computer, that can be improved to self-improve, become superhuman and take over the universe. All am saying is that the possibility isn’t as inevitable as some seem to believe. If forced, I would probably assign a 1% probability to it but still feel uncomfortable about that (which isn’t to equate with risks from AI in general, I don’t think FOOM is required for AI’s to pose a risk).
        
        I think that Eliezer crossed the border of what can sensibly be said about this topic at the present time when he says that AI will likely invent molecular nanotechnology in a matter of hours or days. Jürgen Schmidhuber is the only person I could find who might agree with that. Even Shane Legg is more skeptical. And since I do not yet have the education to evaluate state of the art AI research myself I will side with the experts and say that Eliezer is likely wrong. Of course, I have no authority but I have to make a decision. I don’t feel it would be reasonable to believe Eliezer here without restrictions.
        
        Just because the possibility of superhuman AI seems to be disjunctive on some level doesn’t mean that there are no untested assumptions underlying the claims that such an outcome is possible. Reduce the vagueness and you will discover a set assumptions that need to be true in conjunction.
        TheOtherDave 16 Jan 2012 20:29 UTC
        4 points
        Parent
        So, I’m having a lot of difficulty mapping your response to the question I asked. But if I’ve understood your response, you are arguing that technology analogous to the technology-developing functions of human intelligence might not be in principle possible, or that if developed might not be capable of significantly greater technology-developing power than human intelligence is.
        
        In other words, that assumptions 5 and/or 6 might be false.
        
        I agree that it’s possible. Similar things are true of the other examples you give: it’s possible that technological echolocation, or technological walking, or technological photosynthesis, either aren’t possible in principle, or can’t be significantly more powerful than their naturally evolved analogs. (Do you actually believe that to be true of those examples, incidentally?)
        
        This seems to me highly implausible, which is why my confidence for A5 and A6 are very high. (I have similarly high confidence in our ability to develop machines more efficient than human legs at locomotion, machines more efficient at converting sunlight to useful work than plants, and more efficient at providing sonar-based information about their surroundings than bats.)
        
        So, OK. We’ve identified a couple of specific, relevant assertions for which you think that my confidence is too high. Awesome! That’s progress.
        
        So, what level of confidence do you think is justified for those assertions? I realize that you reject assigning numbers to reported confidence, so OK… do you have a preferred way of comparing levels of confidence? Or do you reject the whole enterprise of such comparisons?
        
        Incidentally: you say a lot of other stuff here which seems entirely beside my point… I think because you’re running out ahead to arguments you think I might make some day. I will return to that stuff if I ever actually make an argument to which it’s relevant.
        asr 16 Jan 2012 18:47 UTC
        0 points
        Parent
        I am uneasy with premise 4. I think human technological progress involves an awful lot of tinkering and evolution, and intelligent action by the technologist is not the hardest part. I doubt that if we could all think twice as quickly*, we would develop technology twice as quickly. The real rate-limiting step isn’t the design, it’s building things and testing them.
        
        This doesn’t mean that premise 4 is wrong, exactly, but it means that I’m worried it’s going to be used in an inconsistent, equivocal, way.
        
        *I am picturing taking all the relevant people, and having them think the same thoughts they do today, in half the time. Presumably they use the newly-free time to think more thoughts.
        TheOtherDave 16 Jan 2012 19:27 UTC
        1 point
        Parent
        Fair enough. If I end up using it equivocally or inconsistently, please do call me out on it.
        
        Note that absolutely nothing I’ve said so far implies people thinking the same thoughts they do today in half the time.
        asr 16 Jan 2012 19:53 UTC
        0 points
        Parent
        No no, I wasn’t attributing “same thoughts in half the time” to you. I was explaining the thought-experiment I was using to distinguish “intelligence” as an input from other requirements for technology creation.
        TheOtherDave 16 Jan 2012 21:09 UTC
        1 point
        Parent
        If what you understand by “intelligence” is the ability to arrive at the same conclusions faster, then I agree with you that that thing has almost nothing to do with technological development, and I should probably backup and rewrite assumptions 4-6 while tabooing the word “intelligence”
  - David Althaus 17 Jan 2012 12:39 UTC
    2 points
    Parent
    
    If you just tell it to maximize paperclips then this can be realized in an infinite number of ways
    
    If the AI has the goal to maximize the number of paperclips in the universe and it is a rational utility maximizer it will try to find the most efficient way to do that, and there is probably only one (i.e. recursive self-improvement, acquiring ressources, etc..) You’re right, if the AI isn’t a rational utility maximizer it could do anything.
    - XiXiDu 17 Jan 2012 13:24 UTC
      −2 points
      Parent
      
      You’re right, if the AI isn’t a rational utility maximizer it could do anything.
      
      I don’t think this follows. Even a rational utility maximizer can maximize paperclips in a lot of different ways. How it does it is fundamentally dependent on its utility-function and how precisely it was defined. If there are no constraints in the form of design and goal parameters then it can maximize paperclips in all sorts of ways that don’t demand recursive self-improvement. “Utility” does only become well-defined if we precisely define what it means to maximize it. Just maximizing paperclips doesn’t define how quickly and how economically it is supposed to happen.
      - David Althaus 17 Jan 2012 15:04 UTC
        2 points
        Parent
        I don’t understand your arguments.
        
        My intuition is this: If the AI has the goal “The more paperclips the better: e.g. an universe containing 1002 paperclips is “400 utilons” better than an universe containing 602″ then it will try to maximize paperclips. And if it tries this by reciting poems from the Bible then it isn’t a rational AI, since it does not employ the most efficient strategy for maximizing paperclips.
        
        The very definition of “rational utility maximizer” implies that it will try to maximize utilons as fast and as efficient as possible. Sure, it’s possible that recursive self-improvement isn’t a good strategy for doing so, but I think it’s not unlikely. Am I missing something?
        
        If the AI has a different utility function like “paperclips are pretty cool, but not as awesome as other things” then it will do other things.
        wedrifid 17 Jan 2012 16:39 UTC
        1 point
        Parent
        
        The very definition of “rational utility maximizer” implies that it will try to maximize utilons as fast and as efficient as possible. Sure, it’s possible that recursive self-improvement isn’t a good strategy for doing so, but I think it’s not unlikely. Am I missing something?
        
        No, you are not missing something at least not here. XiXiDu simply doesn’t have a firm grasp on the concept of optimization. Don’t let this confuse you.
        XiXiDu 17 Jan 2012 16:24 UTC
        −2 points
        Parent
        
        The very definition of “rational utility maximizer” implies that it will try to maximize utilons as fast and as efficient as possible.
        
        The problem is that “utility” has to be defined. To maximize expected utility does not imply certain actions, efficiency and economic behavior, or the drive to protect yourself. You can also rationally maximize paperclips without protecting yourself if it is not part of your goal parameters.
        
        I know what kind of agent you assume. I am just pointing out what needs to be true in conjunction to make the overall premise true. Expected utility maximizing does not equal what you assume. You can also assign utility to maximize paperclips as long as nothing turns you off but don’t care about being turned off. If an AI is not explicitly programmed to care about it, then it won’t.
  - faul_sname 16 Jan 2012 9:51 UTC
    1 point
    Parent
    
    Take for example a Babylonian mathematician. If you traveled back in time and were to accelerate his thinking a million times, would he discover place-value notation to encode numbers in a few days? I doubt it. Even if he was to digest all the knowledge of his time in a few minutes, I just don’t see him coming up with quantum physics after a short period of time.
    
    I suspect that he would invent place-value notation or something similar within a few “days.” Remember that a “day” at 1 million times speedup is over 2 millennia. Now, there are clearly some difficulties in testing what an isolated human produces in 2300 years, but we could look at cases of hermits who left civilization for periods of years or decades. Did any of them come up with revolutionary ideas? If the answer is yes in a decade or two, it is almost certain that a human at 1,000,000x speedup would have several such insights (assuming that such a speedup doesn’t result in insanity). I can’t imagine the mathematician coming up with quantum mechanics, but that could easily be a failure of my 1x speed brain.