rwallace comments on Connecting Your Beliefs (a call for help)

rwallace 20 Nov 2011 7:10 UTC
3 points
That’s actually a good question. Let me rephrase it to something hopefully clearer:

Compartmentalization is an essential safety mechanism in the human mind; it prevents erroneous far mode beliefs (which we all adopt from time to time) from having disastrous consequences. A man believes he’ll go to heaven when he dies. Suicide is prohibited in a patch for the obvious problem, but there’s no requirement to make an all-out proactive effort to stay alive. Yet when he gets pneumonia, he gets a prescription for penicillin. Compartmentalization literally saves his life. In some cases many other lives, as we saw when it failed on 9/11.

Here we have a case study where a man of intelligence and goodwill redirected his entire life down a path of negative utility on the basis of reading a single paragraph of sloppy wishful thinking backed up by no evidence whatsoever. (The most straightforward refutation of that paragraph is that creating a machine with even a noteworthy fraction of human intelligence is far beyond the capacity of any human mind; the relevant comparison of such a machine if built would be with that which created it, which would have to be a symbiosis of humanity and its technology as a whole—with that symbiosis necessarily being much more advanced than anything we have today.) What went wrong?

The most obvious part of the answer is that this is an error to which we geeks are particularly prone. (Supporting data: terrorists are disproportionately likely to be trained in some branch of engineering.) Why? Well, we are used to dealing in domains where we can actually apply long chains of logic with success; particularly in the age range when we are old enough to have forgotten how fallible were our first attempts at such logic, yet young enough to be still optimists, it’s an obvious trap to fall into.

Yet most geeks do actually manage to stay out of the trap. What else goes wrong?

It seems to me that there must be a parameter in the human mind for grasping the inertia of the world, for understanding at a gut level how much easier is concept than reality, that we can think in five minutes of ideas that the labor of a million people for a thousand years cannot realize. I suppose in some individuals this parameter must be turned up too high, and they fall too easily into the trap of learned helplessness. And in some it must be turned too low, and those of us for whom this is the case undertake wild projects with little chance of success; and if ninety-nine fail for every one who succeeds, that can yet drive the ratchet of progress.

But we easily forget that progress is not really a ratchet, and the more advanced our communications, the more lethal bad ideas become, for just as our transport networks spread disease like the 1918 flu epidemic which killed more people in a single year than the First World War killed in four years, so our communication networks spread parasite memes deadlier still. And we can’t shut down the networks. We need them too badly.

I’ve seen the Singularity mutate from a harmless, even inspiring fantasy, to a parasite meme that I suspect could well snuff out the entire future of intelligent life. It’s proving itself in many cases immune to any weight of evidence against it; perhaps worst of all, it bypasses ethical defenses, for it can be spread by people of honest goodwill.

Compartmentalization seems to be the primary remaining defense. When that fails, what have we left? This is not a rhetorical question; it may be one of the most important in the world right now.
- drethelin 20 Nov 2011 7:36 UTC
  7 points
  Parent
  Compartmentalization may make ridiculous far beliefs have less of an impact on the world, but it also allows those beliefs to exist in the first place. If your beliefs about religion depended on the same sort evidence that underpins your beliefs about whether your car is running, then you could no more be convinced of religion than you could be convinced by a mechanic that your car “works” even though it does not start.
  - rwallace 20 Nov 2011 7:40 UTC
    5 points
    Parent
    So your suggestion is that we should de-compartmentalize, but in the reverse direction to that suggested by the OP, i.e. instead of propagating forward from ridiculous far beliefs, become better at back-propagating and deleting same? There is certainly merit in that suggestion if it can be accomplished. Any thoughts on how?
    - drethelin 20 Nov 2011 7:48 UTC
      4 points
      Parent
      You don’t understand. Decompartmentalization doesn’t have a direction. You don’t go forwards towards a belief or backwards from a belief, or whatever. If your beliefs are decompartmentalized that means that the things you believe will impact your other beliefs reliably. This means that you don’t get to CHOOSE what you believe. If you think the singularity is all important and worth working for, it’s BECAUSE all of your beliefs align that way, not because you’ve forced your mind to align itself with that belief after having it.
      - rwallace 20 Nov 2011 7:57 UTC
        4 points
        Parent
        I understand perfectly well how a hypothetical perfectly logical system would work (leaving aside issues of computational tractability etc.). But then, such a hypothetical perfectly logical system wouldn’t entertain such far mode beliefs in the first place. What I’m discussing is the human mind, and the failure modes it actually exhibits.
- marchdown 20 Nov 2011 9:12 UTC
  3 points
  Parent
  What is that evidence against singularity which you’re alluding to?
  - rwallace 20 Nov 2011 10:00 UTC
    8 points
    Parent
    I discuss some of it at length here: http://lesswrong.com/lw/312/the_curve_of_capability/
    
    I’ll also ask the converse question: given that you can’t typically prove a negative (I can’t prove the nonexistence of psychic powers or flying saucers either), if what we are observing doesn’t constitute evidence against the Singularity in your opinion, then what would?
    - Kaj_Sotala 20 Nov 2011 13:43 UTC
      21 points
      Parent
      
      if what we are observing doesn’t constitute evidence against the Singularity in your opinion, then what would?
      
      I’m not marchdown, but:
      
      Estimating the probability of a Singularity requires looking at various possible advantages of digital minds and asking what would constitute evidence against such advantages being possible. Some possibilities:
      
      Superior processing power. Evidence against would be the human brain already being close to the physical limits of what is possible.
      Superior serial power: Evidence against would be an inability to increase the serial power of computers anymore.
      Superior parallel power: Evidence against would be an indication of extra parallel power not being useful for a mind that already has human-equivalent (whatever that means) parallel power.
      Improved algorithms: Evidence against would be the human brain’s algorithms already being perfectly optimized and with no further room for improvement.
      Designing new mental modules: Evidence against would be evidence that the human brain’s existing mental modules are already sufficient for any cognitive task with any real-world relevance.
      Modifiable motivation systems: Evidence against would be evidence that humans are already optimal at motivating themselves to work on important tasks, that realistic techniques could be developed to make humans optimal in this sense, or that having a great number of minds without any akrasia issues would have no major advantage over humans.
      Copyability: Evidence against would be evidence that minds cannot be effectively copied, maybe because there won’t be enough computing power to run many copies. Alternatively, that copying minds would result in rapidly declining marginal returns and that the various copying advantages discussed by e.g. Hanson and Shulman aren’t as big as they seem.
      Perfect co-operation: Evidence against would be that no minds can co-operate better than humans do, or at least not to such an extent that they’d receive a major advantage. Also, evidence of realistic techniques bringing humans to this level of co-operation.
      Superior communication: Evidence against would be that no minds can communicate better than humans do, or at least not to such an extent that they’d receive a major advantage. Also, evidence of realistic techniques bringing humans to this level of communication.
      Transfer of skills: Evidence against would be that no minds can teach better than humans do, or at least not to such an extent that they’d receive a major advantage. Also, evidence of realistic techniques bringing humans to this level of skill transfer.
      Various biases: Evidence against would either be that human cognitive biases are not actually major ones, or that no mind architecture could overcome them. Also, evidence that humans actually have a realistic chance of overcoming most biases.
      
      Depending on how you define “the Singularity”, some of these may be irrelevant. Personally, I think the most important aspect of the Singularity is whether minds drastically different from humans will eventually take over, and how rapid the transition could be. Excluding the possibility of a rapid takeover would require at least strong evidence against gains from increased serial power, increased parallel power, improved algorithms, new mental modules, copyability, and transfer of skills. That seems quite hard to come by, especially once you take into account the fact that it’s not enough to show that e.g. current trends in hardware development show mostly increases in parallel instead of serial power—to refute the gains from increased serial power, you’d also have to show that this is indeed some deep physical limit which cannot be overcome.
      - rwallace 20 Nov 2011 14:53 UTC
        5 points
        Parent
        Okay, to look at some of the specifics:
        
        Superior processing power. Evidence against would be the human brain already being close to the physical limits of what is possible.
        
        The linked article is amusing but misleading; the described ‘ultimate laptop’ would essentially be a nuclear explosion. The relevant physical limit is ln(2)kT energy dissipated per bit erased; in SI units at room temperature this is about 4e-21. We don’t know exactly how much computation the human brain performs; middle-of-the-road estimates put it in the ballpark of 1e18 several-bit operations per second for 20 watts, which is not very many orders of magnitude short of even the theoretical limit imposed by thermodynamics, let alone whatever practical limits may arise once we take into account issues like error correction, communication latency and bandwidth, and the need for reprogrammability.
        
        Superior serial power: Evidence against would be an inability to increase the serial power of computers anymore.
        
        Indeed we hit this some years ago. Of course as you observe, it is impossible to prove serial speed won’t start increasing again in the future; that’s inherent in the problem of proving a negative. If such proof is required, then no sequence of observations whatsoever could possibly count as evidence against the Singularity.
        
        Superior parallel power:
        
        Of course uses can always be found for more parallel power. That’s why we humans make use of it all the time, both by assigning multiple humans to a task, and increasingly by placing multiple CPU cores at the disposal of individual humans.
        
        Improved algorithms:
        
        Finding these is (assuming P!=NP) intrinsically difficult; humans and computers can both do it, but neither will ever be able to do it easily.
        
        Designing new mental modules:
        
        As for improved algorithms.
        
        Modifiable motivation systems:
        
        An advantage when they reduce akrasia, a disadvantage when they make you more vulnerable to wireheading.
        
        Copyability: Evidence against would be evidence that minds cannot be effectively copied, maybe because there won’t be enough computing power to run many copies.
        
        Indeed there won’t, at least initially; supercomputers don’t grow on trees. Of course, computing power tends to become cheaper over time, but that does take time, so no support for hard takeoff here.
        
        Alternatively, that copying minds would result in rapidly declining marginal returns and that the various copying advantages discussed by e.g. Hanson and Shulman aren’t as big as they seem.
        
        Matt Mahoney argues that this will indeed happen because an irreducible fraction of the knowledge of how to do a job is specific to that job.
        
        Perfect co-operation:
        
        Some of the more interesting AI work has been on using a virtual market economy to allocate resources between different modules within an AI program, which suggests computers and humans will be on the same playing field.
        
        Superior communication:
        
        Empirically, progress in communication technology between humans outpaces progress in AI, and has done so for as long as digital computers have existed.
        
        Transfer of skills:
        
        Addressed under copyability.
        
        Various biases:
        
        Hard to say, both because it’s very hard to see our own biases, and because a bias that’s adaptive in one situation may be maladaptive in another. But if we believe maladaptive biases run deep, such that we cannot shake them off with any confidence, then we should be all the more skeptical of our far beliefs, which are the most susceptible to bias.
        
        Of course, there is also the fact that humans can and do tap the advantages of digital computers, both by running software on them, and in the long run potentially by uploading to digital substrate.
        Giles 20 Nov 2011 17:44 UTC
        6 points
        Parent
        
        we should be all the more skeptical of our far beliefs, which are the most susceptible to bias.
        
        Just out of interest… assume my far beliefs take the form of a probability distribution of possible future outcomes. How can I be “skeptical” of that? Given that something will happen in the future, all I can do is update in the direction of a different probability distribution.
        
        In other words, which direction am I likely to be biased in?
        Eugine_Nier 20 Nov 2011 18:31 UTC
        9 points
        Parent
        
        In other words, which direction am I likely to be biased in?
        
        In the direction of overconfidence, i.e., assigning too much probability mass to your highest probability theory.
        rwallace 20 Nov 2011 20:25 UTC
        5 points
        Parent
        We should update away from beliefs that the future will resemble a story, particularly a story whose primary danger will be fought by superheroes (most particularly for those of us who would personally be among the superheroes!) and towards beliefs that the future will resemble the past and the primary dangers will be drearily mundane.
        Nornagest 20 Nov 2011 21:17 UTC
        7 points
        Parent
        The future will certainly resemble a story—or, more accurately, will be capable of being placed into several plausible narrative frames, just as the past has. The bias you’re probably trying to point to is in interpreting any particular plausible story as evidence for its individual components—or, for that matter, against.
        
        The conjunction fallacy implies that any particular vision of a Singularity-like outcome is less likely than our untrained intuitions would lead us to believe. It’s an excellent reason to be skeptical of any highly derived theories of the future—the specifics of Ray Kurzweil’s singularity timeline, for example, or Robin Hanson’s Malthusian emverse. But I don’t think it’s a good reason to be skeptical of any of the dominant singularity models in general form. Those don’t work back from a compelling image to first principles; most of them don’t even present specific consequential predictions, for fairly straightforward reasons. All the complexity is right there on the surface, and attempts to narrativize it inevitably run up against limits of imagination. (As evidence, the strong Singularity has been fairly poor at producing fiction when compared to most future histories of comparable generality; there’s no equivalent of Heinlein writing stories about nuclear-powered space colonization, although there’s quite a volume of stories about weak or partial singularities.)
        
        So yes, there’s not going to be a singleton AI bent on turning us all into paperclips. But that’s a deliberately absurd instantiation of a much more general pattern. I can conceive of a number of ways in which the general pattern too might be wrong, but the conjunction fallacy doesn’t fly; a number of attempted debunkings, meanwhile, do suffer from narrative fixation issues.
        
        Superhero bias is a more interesting question—but it’s also a more specific one.
        rwallace 20 Nov 2011 21:36 UTC
        3 points
        Parent
        Well, any sequence of events can be placed in a narrative frame with enough of a stretch, but the fact remains that different sequence of events differ in their amenability to this; fiction is not a random sampling from the space of possible things we could imagine happening, and the Singularity is narratively far stronger than most imaginable futures, to a degree that indicates bias we should correct for. I’ve seen a fair bit of strong Singularity fiction at this stage, though being, well, singular, it tends not to be amenable to repeated stories by the same author the way Heinlein’s vision of nuclear powered space colonization was.
        lessdazed 21 Nov 2011 0:47 UTC
        1 point
        Parent
        
        Empirically, progress in communication technology between humans outpaces progress in AI, and has done so for as long as digital computers have existed.
        
        The best way to colonize Alpha Centauri has always been to wait for technology to improve rather than launching an expedition, but it’s impossible for that to continue to be true indefinitely. Short of direct mind-to-mind communication or something with a concurrent halt to AI progress, AI advances will probably outpace human communication advances in the near to medium term.
        
        It seems unreasonable to believe human minds, optimized according to considerations such as politicking in addition to communication, will be able to communicate just as well as designed AIs. Human mind development was constrained by ancestral energy availability and head size, etc., so it’s unlikely that we represent optimally sized minds to form a group of minds, even assuming an AI isn’t able to reap huge efficiencies by becoming essentially as a single mind, regardless of scale.
        rwallace 21 Nov 2011 0:58 UTC
        4 points
        Parent
        Or human communications may stop improving because they are good enough to no longer be a major bottleneck, in which case it may not greatly matter whether other possible minds could do better. Amdahl’s law: if something was already only ten percent of total cost, improving it by a factor of infinity would reduce total cost by only that ten percent.
      - XiXiDu 20 Nov 2011 15:08 UTC
        −1 points
        Parent
        
        Superior processing power. Evidence against would be the human brain already being close to the physical limits of what is possible.
        
        It is often cited how much faster expert systems are at their narrow area of expertise. But does that mean that the human brain is actually slower or that it can’t focus its resources on certain tasks? Take for example my ability to simulated some fantasy environment, off the top of my head, in front of my mind’s eye. Or the ability of humans to run real-time egocentric world-simulations to extrapolate and predict the behavior of physical systems and other agents. Our best computers don’t even come close to that.
        
        Superior serial power: Evidence against would be an inability to increase the serial power of computers anymore.
        
        Chip manufacturers are already earning most of their money by making their chips more energy efficient and working in parallel.
        
        Improved algorithms: Evidence against would be the human brain’s algorithms already being perfectly optimized and with no further room for improvement.
        
        We simply don’t know how efficient the human brain’s algorithms are. You can’t just compare artificial algorithms with the human ability to accomplish tasks that were never selected for by evolution.
        
        Designing new mental modules: Evidence against would be evidence that the human brain’s existing mental modules are already sufficient for any cognitive task with any real-world relevance.
        
        This is an actual feature. It is not clear that you can have a general intelligence with a huge amount of plasticity that would work at all rather than messing itself up.
        
        Modifiable motivation systems: Evidence against would be evidence that humans are already optimal at motivating themselves to work on important tasks...
        
        This is an actual feature, see dysfunctional autism.
        
        Copyability: Evidence against would be evidence that minds cannot be effectively copied, maybe because there won’t be enough computing power to run many copies.
        
        You don’t really anticipate to be surprised by evidence on this point because your definition of “minds” doesn’t even exist and therefore can’t be shown not to be copyable. And regarding brains, show me some neuroscientists who think that minds are effectively copyable.
        
        Perfect co-operation: Evidence against would be that no minds can co-operate better than humans do, or at least not to such an extent that they’d receive a major advantage.
        
        Cooperation is a delicate quality. Too much and you get frozen, too little and you can’t accomplish much. Human science is a great example of a balance between cooperation and useful rivalry. How is a collective intellect of AGI’s going to preserve the right balance without mugging itself into pursuing insane expected utility-calculations?
        
        Excluding the possibility of a rapid takeover would require at least strong evidence against gains...
        
        Wait, are you saying that the burden of proof is with those who are skeptical of a Singularity? Are you saying that the null hypothesis is a rapid takeover? What evidence allowed you to make that hypothesises in the first place? Making up unfounded conjectures and then telling others to disprove them will lead to privileging random high-utility possibilities, that sound superficially convincing, while ignoring other problems that are based on empirical evidence.
        
        ...it’s not enough to show that e.g. current trends in hardware development show mostly increases in parallel instead of serial power—to refute the gains from increased serial power, you’d also have to show that this is indeed some deep physical limit which cannot be overcome.
        
        All that doesn’t even matter. Computational resources are mostly irrelevant when it comes to risks from AI. What you have to show is that recursive self-improvement is possible. It is a question of whether you can dramatically speed up the discovery of unknown unknowns.
        What links here?
        XiXiDu's comment on Criticisms of intelligence explosion by lukeprog (22 Nov 2011 18:49 UTC; 15 points)
- Morendil 20 Nov 2011 20:28 UTC
  1 point
  Parent
  
  a parasite meme that I suspect could well snuff out the entire future of intelligent life
  
  How do you propose that would happen?
  - rwallace 20 Nov 2011 21:44 UTC
    3 points
    Parent
    We’ve had various kinds of Luddism before, but this one is particularly lethal in being a form that appeals to people who had been technophiles. If it spreads enough, best case scenario is the pool of people willing to work on real technological progress shrinks, worst case scenario is regulation that snuffs out progress entirely, and we get to sit around bickering about primate politics until whatever window of time we had runs out.
    - Morendil 21 Nov 2011 7:43 UTC
      3 points
      Parent
      That’s awfully vague. “Whatever window of time we had”, what does that mean?
      
      There’s one kind of “technological progress” that SIAI opposes as far as I can tell: working on AGI without an explicit focus on Friendliness. Now if you happen to think that AGI is a must-have to ensure the long-term survival of humanity, it seems to me that you’re already pretty much on board with the essential parts of SIAI’s worldview, indistinguishable from them as far as the vast majority is concerned.
      
      Otherwise, there’s plenty of tech that is entirely orthogonal with the claims of SIAI: cheap energy, health, MNT, improving software engineering (so-called), and so on.
      - rwallace 21 Nov 2011 10:16 UTC
        3 points
        Parent
        
        That’s awfully vague. “Whatever window of time we had”, what does that mean?
        
        The current state of the world is unusually conducive to technological progress. We don’t know how long this state of affairs will last. Maybe a long time, maybe a short time. To fail to make progress as rapidly as we can is to gamble the entire future of intelligent life on it lasting a long time, without evidence that it will do so. I don’t think that’s a good gamble.
        
        There’s one kind of “technological progress” that SIAI opposes as far as I can tell: working on AGI without an explicit focus on Friendliness.
        
        I have seen claims to the contrary from a number of people, from Eliezer himself a number of years ago up to another reply to your comment right now. If SIAI were to officially endorse the position you just suggested, my assessment of their expected utility would significantly increase.
        Morendil 21 Nov 2011 13:45 UTC
        6 points
        Parent
        Well, SIAI isn’t necessarily a homogenous bunch of people, with respect to what they oppose or endorse, but did you look for instance at Michael Anissimov’s entries on MNT? (Focusing on that because it’s the topic of Risto’s comment and you seem to see that as a confirmation of your thesis.) You don’t get the impression that he thinks it’s a bad idea, quite the contrary: http://www.acceleratingfuture.com/michael/blog/category/nanotechnology/
        
        Here is Eliezer on the SL4 mailing list:
        
        If you solve the FAI problem, you probably solve the nanotech problem. If you solve the nanotech problem, you probably make the AI problem much worse. My preference for solving the AI problem as quickly as possible has nothing to do with the relative danger of AI and nanotech. It’s about the optimal ordering of AI and nanotech.
        
        The Luddites of our times are (for instance) groups like the publishing and music industries, the use of that label to describe the opinions of people affiliated with SIAI just doesn’t make sense IMO.
        MichaelAnissimov 6 Dec 2011 20:22 UTC
        2 points
        Parent
        Human-implemented molecular nanotechnology is a bad idea. I just talk about it to attract people in who think it’s important. MNT knowledge is a good filter/generator for SL3 and beyond thinkers.
        
        MNT without friendly superintelligence would be nothing but a disaster.
        
        It’s true that SIAI isn’t homogeneous though. For instance, Anna is much more optimistic about uploads than I am personally.
        rwallace 22 Nov 2011 0:00 UTC
        1 point
        Parent
        Thanks for the link, yes, that does seem to be a different opinion (and some very interesting posts).
        
        I agree with you about the publishing and music industries. I consider current rampant abuse of intellectual property law to be a bigger threat than the Singularity meme, sufficiently so that if your comparative advantage is in politics, opposing that abuse probably has the highest expected utility of anything you could be doing.
      - Risto_Saarelma 21 Nov 2011 9:37 UTC
        −1 points
        Parent
        Molecular nanotechnology and anything else that can be weaponized to let a very small group of people effectively kill a very large group of people is probably something SIAI-type people would like to be countered with a global sysop scenario from the moment it gets developed.