CarlShulman comments on against “AI risk”

CarlShulman 11 Apr 2012 23:46 UTC
41 points
Speaking only for myself, most of the bullets you listed are forms of AI risk by my lights, and the others don’t point to comparably large, comparably neglected areas in my view (and after significant personal efforts to research nuclear winter, biotechnology risk, nanotechnology, asteroids, supervolcanoes, geoengineering/climate risks, and non-sapient robotic weapons). Throwing in all x-risks and the kitchen sink in, regardless of magnitude, would be virtuous in a grand overview, but it doesn’t seem necessary when trying to create good source materials in a more neglected area.

bio/nano-tech disaster

Not AI risk.

I have studied bio risk (as has Michael Vassar, who has even done some work encouraging the plucking of low-hanging fruit in this area when opportunities arose), and it seems to me that it is both a smaller existential risk than AI, and nowhere near as neglected. Likewise the experts in this survey, my conversations with others expert in the field, and reading their work.

Bio existential risk seems much smaller than bio catastrophic risk (and not terribly high in absolute terms), while AI catastrophic and x-risk seem close in magnitude, and much larger than bio x-risk. Moreover, vastly greater resources go into bio risks, e.g. Bill Gates is interested and taking it up at the Gates Foundation, governments pay attention, and there are more opportunities for learning (early non-extinction bio-threats can mobilize responses to guard against later ones).

This is in part because most folk are about as easily mobilized against catastrophic as existential risks (e.g. Gates thinks that AI x-risk is larger than bio x-risk, but prefers to work on bio rather than AI because he thinks bio catastrophic risk is larger, at least in the medium-term, and more tractable). So if you are especially concerned about x-risk, you should expect bio risk to get more investment than you would put into it (given the opportunity to divert funds to address other x-risks).

Nanotech x-risk would seem to come out of mass-producing weapons that kill survivors of an all out war (which leaves neither side standing), like systems that could replicate in the wild and destroy the niche of primitive humans, really numerous robotic weapons that would hunt down survivors over time, and such like. The FHI survey gives it a lot of weight, but after reading the work of the Foresight Institute and Center for Responsible Nanotechnology (among others) from the last few decades since Drexler’s books, I am not very impressed with the magnitude of the x-risk here or the existence of distinctive high-leverage ways to improve outcome around the area, and the Foresight Institute continues to operate in any case (not to mention Eric Drexler visiting FHI this year).

Others disagree (Michael Vassar has worked with the CRN, and Eliezer often names molecular nanotechnology as the x-risk he would move to focus on if he knew that AI was impossible), but that’s my take.

Malthusian upload scenario

This is AI risk. Brain emulations are artificial intelligence by standard definitions, and in articles like Chalmers’ “The Singularity: a Philosophical Analysis.”

highly destructive war

It’s hard to destroy all life with a war not involving AI, or the biotech/nanotech mentioned above. The nuclear winter experts have told me that they think x-risk from a global nuclear war is very unlikely conditional on such a war happening, and it doesn’t seem that likely.

bad memes/philosophies spreading among humans or posthumans and overriding our values

There are already massive, massive, massive investments in tug-of-war over politics, norms, and values today. Shaping the conditions or timelines for game-changing technologies looks more promising to me than adding a few more voices to those fights. On the other hand, Eliezer has some hopes for education in rationality and critical thinking growing contagiously to shift some of those balances (not as a primary impact, and I am more skeptical). Posthuman value evolution does seem to sensibly fall under “AI risk,” and shaping the development and deployment of technologies for posthumanity seems like a leveraged way to affect that.

upload singleton ossifying into a suboptimal form compared to the kind of superintelligence that our universe could support

AI risk again.

(Are there any doomsday cults that say “doom is probably coming, we’re not sure how but here are some likely possibilities”?)

Probably some groups with a prophecy of upcoming doom, looking to every thing in the news as a possible manifestation.
- endoself 12 Apr 2012 2:57 UTC
  7 points
  Parent
  Are you including just the extinction of humanity in your definition of x-risk in this comment or are you also counting scenarios resulting in a drastic loss of technological capability?
  - CarlShulman 12 Apr 2012 3:00 UTC
    6 points
    Parent
    I expect losses of technological capability to be recovered with high probability.
    What links here?
    JonahS's comment on Common sense as a prior by Nick_Beckstead (12 Aug 2013 17:30 UTC; 2 points)
    - JoshuaZ 12 Apr 2012 3:30 UTC
      10 points
      Parent
      Why? This is highly non-obvious. To reach our current technological level, we had to use a lot of non-renewable resources. There’s still a lot of coal and oil left, but the remaining coal and oil is harder to reach and much more technologically difficult to reliably use. That trend will only continue. It isn’t obvious that if something set the tech level back to say 1600 that we’d have the resources to return to our current technology level.
      - CarlShulman 12 Apr 2012 3:43 UTC
        15 points
        Parent
        It’s been discussed repeatedly here on Less Wrong, and in many other places. The weight of expert opinion is on recovery, and I think the evidence is strong. Most resources are more accessible in ruined cities than they were in the ground, and more expensive fossil fuels can be substituted for by biomass, hydropower, efficiency, and so forth. It looks like there was a lot of slack in human development, e.g. animal and plant breeding is still delivering good returns after many centuries, humans have been adapting to civilization over the last thousands of years and would continue to become better adapted with a long period of low-fossil fuel near-industrial technology. And for many catastrophes knowledge from the previous civilization would be available to future generations.
        JoshuaZ 12 Apr 2012 13:25 UTC
        10 points
        Parent
        
        It’s been discussed repeatedly here on Less Wrong, and in many other places. The weight of expert opinion is on recovery
        
        Can you give sources for this? I’m particularly interested in the claim about expert opinion, since there doesn’t seem to be much discussion in the literature of this. Bostrom has mentioned it, but hasn’t come to any detailed conclusion. I’m not aware of anyone else discussing it.
        
        Most resources are more accessible in ruined cities than they were in the ground
        
        Right. This bit has been discussed on LW before in the context of many raw metals. The particularly good example is aluminum which is resource intensive and technically difficult to refine, but is easy to use once one has a refined a bit. That’s been discussed before, and looking around for such discussion I see that you and I discussed that here, but didn’t discuss the power issue in general.
        
        I think you are being optimistic about power. Hydropower and biomass while they can exist with minimal technology (and in fact, the first US commercial power plant outside New York was hydroelectric), they both have severe limitations as power methods. Hydroelectric power can only be placed in limited areas, and large-scale grids are infrastructurally difficult and require a lot of technical coordination and know-how . That’s why the US grids were separate little grids until pretty late. And using hydroelectric power would further restrict the locations that power can be produced, leading to much more severe inefficiences in the grid (due to long-term power transmission and the like). There’s a recent good book, Maggie Koerth Baker’s “Before the Lights Go Out”. , which discusses the difficulties and complexities in electric grids which also discusses in detail the historical problems with running grids. They are often underestimated.
        
        Similarly, direct biomass is not generally as energy dense as coal or oil. You can’t easily use biomass to power trains or airplanes. The technology to make synthetic oil was developed in the 1940s but it is inefficient, technically difficult, and requires a lot of infrastructure.
        
        I also think you are overestimating how much can be done with efficiency at a low tech level. Many of the technologies that can be made more efficient (such as lightbulbs) require a fair bit of technical know-how to use the more efficient version. Thus for example, while fluorescent lights are not much more technically difficult than incandescents, CFLs’ are much more technically difficult.
        
        And efficiency bites you a bit in another direction as well: If your technology is efficient enough, then you don’t have as much local demand on the grid, and you don’t get the benefits of the economies of scale that you get. This was historically a problem even when incandescent light-bulbs were in use- in the first forty years of electrificiation, the vast majority of electric companies failed.
        
        It looks like there was a lot of slack in human development, e.g. animal and plant breeding is still delivering good returns after many centuries
        
        We’re using much more careful and systematic methods of breeding now, and the returns are clearly diminishing- we’re not domesticating new crops, just making them marginally more efficient. It is only large returns because the same plants and animals are in such widespread use.
        
        And for many catastrophes knowledge from the previous civilization would be available to future generations.
        
        This is true for some catastrophes but not all, and I’m not at all sure it will be true for most.. Most humans have minimal technical know-how beyond their own narrow areas. I’m curious to hear more about how you reach this conclusion.
        satt 16 Apr 2012 21:06 UTC
        4 points
        Parent
        This may be worth expanding into a discussion post; I can’t remember any top-level posts devoted to this topic, and I reckon it’s important enough to warrant at least one. Your line of argument seems more plausible to me than CarlShulman’s (although that might change if CS can point to specific experts and arguments for why a technological reset could be overcome).
        Tyrrell_McAllister 12 Apr 2012 16:23 UTC
        3 points
        Parent
        
        Thus for example, while fluorescent lights are not much more technically difficult than incandescents, are much more technically difficult.
        
        Is there a typo in this sentence?
        JoshuaZ 12 Apr 2012 17:07 UTC
        1 point
        Parent
        Yes. Intended to be something like:
        
        Thus for example, while fluorescent lights are not much more technically difficult than incandescents, are much more technically difficult if one wants them to be cheaper and more efficient than incandescents.
    - A1987dM 12 Apr 2012 12:02 UTC
      5 points
      Parent
      On what timescale?
      
      I find the focus on x-risks as defined by Bostrom (those from which Earth-originating intelligent life will never, ever recover) way too narrow. A situation in which 99% of humanity dies and the rest reverts to hunting and gathering for a few millennia before recovering wouldn’t look much brighter than that—let alone one in which humanity goes extinct but in (say) a hundred million years the descendants of (say) elephants create a new civilization. In particular, I can’t see why we would prefer the latter to (say) a civilization emerging on Alpha Centauri—so per the principle of charity I’ll just pretend that instead of “Earth-originating intelligent life” he had said “descendants of present-day humans”.
      - loup-vaillant 12 Apr 2012 22:51 UTC
        3 points
        Parent
        It depends on what you value. I see 3 situations:
        
        Early Singularity. Everyone currently living is saved.
        Late Singularity. Nearly everyone currently living dies anyway.
        Very late Singularity, or “Semi-crush”. everyone currently living dies, and most of our yet to be born descendants (up to the second renaissance) will die as well. There is a point however were everyone is saved.
        Crush. Everyone will die, now and for ever. Plus, humanity dies with our sun.
        
        If you most value those currently living, that’s right, it doesn’t make much difference. But if you care about the future of humanity itself, a Very Late Singularity isn’t such a disaster.
        A1987dM 16 Apr 2012 16:33 UTC
        3 points
        Parent
        Now that I think about it, I care both about those currently living and about humanity itself, but with a small but non-zero discount rate (of the order of the reciprocal of the time humanity has existed so far). Also, I value humanity not only genetically but also memetically, so having people with human genome but Palaeolithic technocultural level surviving would be only slightly better for me than no-one surviving at all.
- Wei Dai 12 Apr 2012 0:11 UTC
  5 points
  Parent
  Perhaps it’s mainly a matter of perceptions, where “AI risk” typically brings to mind a particular doomsday scenario, instead of a spread of possibilities that includes posthuman value drift, which is also not helped by the fact that around here we talk much more about UFAI going FOOM than the other scenarios. Given this, do you think we should perhaps favor phrases like “Singularity-related risks and opportunities” where appropriate?
  - CarlShulman 12 Apr 2012 0:24 UTC
    12 points
    Parent
    I have the opposite perception, that “Singularity” is worse than “artificial intelligence.” If you want to avoid talking about FOOM, “Singularity” has more connotation of that than AI in my perception.
    
    I’m also not sure exactly what you mean by the “single scenario” getting privileged, or where you would draw the lines. In the Yudkowsky-Hanson debate and elsewhere Eliezer talked about many separate posthuman AIs coordinating to divvy up the universe without giving humanity or humane values a share, about monocultures of seemingly separate AIs with shared values derived from a common ancestor, and so forth. Whole brain emulations coming first, which then invent AIs that race ahead of the WBEs were discussed, and so forth.
    - Wei Dai 12 Apr 2012 0:57 UTC
      7 points
      Parent
      
      I have the opposite perception, that “Singularity” is worse than “artificial intelligence.”
      
      I see… I’m not sure what to suggest then. Anyone else have ideas?
      
      I’m also not sure exactly what you mean by the “single scenario” getting privileged, or where you would draw the lines.
      
      I think the scenario that “AI risk” tends to bring to mind is a de novo or brain-inspired AGI (excluding uploads) rapidly destroying human civilization. Here are a couple of recent posts along these lines and using the phrase “AI risk”.
      
      utilitymonster’s What is the best compact formalization of the argument for AI risk from fast takeoff?
      XiXiDu’s A Primer On Risks From AI
      ETA: See also lukeprog’s Facing the Singularity, which talks about this AI risk and none of the other ones you consider to be “AI risk”
      - steven0461 12 Apr 2012 1:04 UTC
        1 point
        Parent
        “Posthumanity” or “posthuman intelligence” or something of the sort might be an accurate summary of the class of events you have in mind, but it sounds a lot less respectable than “AI”. (Though maybe not less respectable than “Singularity”?)
    - Wei Dai 13 Apr 2012 20:37 UTC
      1 point
      Parent
      How about “Threats and Opportunities Associated With Profound Sociotechnological Change”, and maybe shortened to “future-tech threats and opportunities” in informal use?
- Wei Dai 12 Apr 2012 2:40 UTC
  4 points
  Parent
  Apparently it’s also common to not include uploads in the definition of AI. For example, here’s Eliezer:
  Perhaps we would rather take some other route than AI to smarter-than-human intelligence
  
  say, augment humans instead? To pick one extreme example, suppose the one says: The prospect of AI makes me nervous. I would rather that, before any AI is developed, individual humans are scanned into computers, neuron by neuron, and then upgraded, slowly but surely, until they are super-smart; and that is the ground on which humanity should confront the challenge of superintelligence.
  - CarlShulman 12 Apr 2012 2:44 UTC
    9 points
    Parent
    Yeah, there’s a distinction between things targeting a broad audience, where people describe WBE as a form of AI, versus some “inside baseball” talk in which it is used to contrast against WBE.
    - Wei Dai 12 Apr 2012 3:20 UTC
      5 points
      Parent
      That paper was written for the book “Global Catastrophic Risks” which I assume is aimed at a fairly general audience. Also, looking at the table of contents for that book, Eliezer’s chapter was the only one talking about AI risks, and he didn’t mention the three listed in my post that you consider to be AI risks.
      
      Do you think I’ve given enough evidence to support the position that many people, when they say or hear “AI risk”, is either explicitly thinking of something narrower than your definition of “AI risk”, or have not explicitly considered how to define “AI” but is still thinking of a fairly narrow range of scenarios?
      
      Besides that, can you see my point that an outsider/newcomer who looks at the public materials put out by SI (such as Eliezer’s paper and Luke’s Facing the Singularity website) and typical discussions on LW would conclude that we’re focused on a fairly narrow range of scenarios, which we call “AI risk”?
      - CarlShulman 12 Apr 2012 3:35 UTC
        1 point
        Parent
        
        explicitly thinking of something narrower than your definition of “AI risk”, or have not explicitly considered how to define “AI” but is still thinking of a fairly narrow range of scenarios?
        
        Yes.
- Dmytry 12 Apr 2012 4:20 UTC
  3 points
  Parent
  Seems like a prime example of where to apply rationality: what are the consequences to trying to work on AI risk right now? Versus on something else? Does AI risk work have good payoff?
  
  What’s of the historical cases? The one example I know of is this: http://www.fas.org/sgp/othergov/doe/lanl/docs1/00329010.pdf (thermonuclear ignition of atmosphere scenario). Can a bunch of people with little physics related expertise do something about such risks >10 years before? Beyond the usual anti war effort? Bill Gates will work on AI risk when it becomes clear what to do about it.
  - Wei Dai 12 Apr 2012 8:13 UTC
    2 points
    Parent
    
    Can a bunch of people with little physics related expertise do something about such risks >10 years before?
    
    Have you seen Singularity and Friendly AI in the dominant AI textbook?
    - Dmytry 12 Apr 2012 8:42 UTC
      1 point
      Parent
      I’m kind of dubious that you needed ‘beware of destroying mankind’ in a physics textbook to get Teller to check if nuke can cause thermonuclear ignition in atmosphere or seawater, but if it is there, I guess it won’t hurt.
      - Wei Dai 12 Apr 2012 9:18 UTC
        11 points
        Parent
        Here’s another reason why I don’t like “AI risk”: it brings to mind analogies like physics catastrophes or astronomical disasters, and lets AI researchers think that their work is ok as long as they have little chance of immediately destroying Earth. But the real problem is how do we build or become a superintelligence that shares our values, and given this seems very difficult, any progress that doesn’t contribute to the solution but brings forward the date by which we must solve it (or be stuck with something very suboptimal even if it doesn’t kill us) is bad, and this includes AI progress that is not immediately dangerous.
        
        ETA: I expanded this comment into a post here.
        What links here?
        Reframing the Problem of AI Progress by Wei Dai (12 Apr 2012 19:31 UTC; 32 points)
        Dmytry 12 Apr 2012 9:27 UTC
        1 point
        Parent
        Well, there’s this implied assumption that super-intelligence that ‘does not share our values’ shares our domain of definition of the values. I can make a fairly intelligent proof generator, far beyond human capability if given enough CPU time; it won’t share any values with me, not even the domain of applicability; the lack of shared values with it is so profound as to make it not do anything whatsoever in the ‘real world’ that I am concerned with. Even if it was meta—strategic to the point of potential for e.g. search for ways to hack into a mainframe to gain extra resources to do the task ‘sooner’ by wallclock time, it seems very dubious that by mere accident it will have proper symbol grounding, won’t wirelead (i.e. would privilege the solutions that don’t involve just stopping said clock), etc etc. Same goes for other practical AIs, even the evil ones that would e.g. try to take over internet.
        What links here?
        Wei Dai's comment on Thoughts on the Singularity Institute (SI) by HoldenKarnofsky (11 May 2012 7:39 UTC; 35 points)
        Wei Dai 12 Apr 2012 9:30 UTC
        6 points
        Parent
        You’re still falling into the same trap, thinking that your work is ok as long as it doesn’t immediately destroy the Earth. What if someone takes your proof generator design, and uses the ideas to build something that does affect the real world?
        Dmytry 12 Apr 2012 9:44 UTC
        1 point
        Parent
        
        You’re still falling into the same trap, thinking that your work is ok as long as it doesn’t immediately destroy the Earth. What if someone takes your proof generator design, and uses the ideas to build something that does affect the real world?
        
        Well let’s say in 2022 we have a bunch of tools along the lines of automatic problem solving, unburdened by their own will (not because they were so designed but by simple omission of immense counter productive effort). Someone with a bad idea comes around, downloads some open source software, cobbles together some self propelling ‘thing’ that is ‘vastly superhuman’ circa 2012. Keep in mind that we still have our tools that make us ‘vastly superhuman’ circa 2012 , and i frankly don’t see how ‘automatic will’, for lack of better term, is contributing anything here that would make the fully automated system competitive.
        What links here?
        Rain's comment on Thoughts on the Singularity Institute (SI) by HoldenKarnofsky (10 May 2012 20:47 UTC; 2 points)
        Wei Dai 12 Apr 2012 10:18 UTC
        8 points
        Parent
        Well, one thing the self-willed superintelligent AI could do is read your writings, form a model of you, and figure out a string of arguments designed to persuade you to give up your own goals in favor of its goals (or just trick you into doing things that further its goals without realizing it). (Or another human with superintelligent tools could do this as well.) Can you ask your “automatic problem solving tools” to solve the problem of defending against this, while not freezing your mind so that you can no longer make genuine moral/philosophical progress? If you can do this, then you’ve pretty much already solved the FAI problem, and you might as well ask the “tools” to tell you how to build an FAI.
        XiXiDu 12 Apr 2012 11:10 UTC
        −1 points
        Parent
        
        Well, one thing the self-willed superintelligent AI could do is read your writings, form a model of you, and figure out a string of arguments designed to persuade you to give up your own goals in favor of its goals...
        
        Does agency enable the AI to do so? If not, then why wouldn’t a human being not be able to do the same by using the AI in tool mode?
        
        Can you ask your “automatic problem solving tools” to solve the problem of defending against this...
        
        Just make it list equally convincing counter-arguments.
        Wei Dai 12 Apr 2012 20:34 UTC
        3 points
        Parent
        
        Does agency enable the AI to do so? If not, then why wouldn’t a human being not be able to do the same by using the AI in tool mode?
        
        Yeah, I realized this while writing the comment: “(Or another human with superintelligent tools could do this as well.)” So this isn’t a risk with self-willed AI per se. But note this actually makes my original point stronger, since I was arguing against the idea that progress on AI is safe as long as it doesn’t have a “will” to act in the real world.
        
        Just make it list equally convincing counter-arguments.
        
        So every time you look at a (future equivalent of) website or email, you ask your tool to list equally convincing counter-arguments to whatever you’re looking at? What does “equally convincing” mean? An argument that exactly counteracts the one that you’re reading, leaving your mind unchanged?
        Expand this thread
        XiXiDu 13 Apr 2012 9:57 UTC
        1 point
        Parent
        
        So every time you look at a (future equivalent of) website or email, you ask your tool to list equally convincing counter-arguments to whatever you’re looking at?
        
        Sure, why not? I think IBM is actually planning to do this with IBM Watson. Once mobile phones become fast enough you can receive constant feedback about ideas and arguments you encounter.
        
        For example, some commercial tells you that you can lose 10 pounds in 1 day by taking a pill. You then either ask your “IBM Oracle” or have it set up to give you automatic feedback. It will then tell you that there are no studies that indicate that something as advertised is possible and that it won’t be healthy anyway. Or something along those lines.
        
        I believe that in future it will be possible to augment everything with fact-check annotations.
        
        But that’s besides the point. The idea was that if you run the AI box experiment with Eliezer posing as malicious AI trying to convince the gatekeeper to let it out of the box, and at the same time as a question answering tool using the same algorithms as the AI, then I don’t think someone would let him out of the box. He would basically have to destroy his own arguments by giving unbiased answers about the trustworthiness of the boxed agent and possible consequences of letting it out of the box. At the very best the AI in agent mode would have to contradict the tool mode version and thereby reveal that it is dishonest and not trustworthy.
        khafra 11 May 2012 14:58 UTC
        1 point
        Parent
        
        So every time you look at a (future equivalent of) website or email, you ask your tool to list equally convincing counter-arguments to whatever you’re looking at?
        
        Sure, why not?
        
        When I’m feeling down and my mom sends me an email trying to cheer me up, that’ll be a bit of a bummer.
        Dmytry 12 Apr 2012 11:31 UTC
        2 points
        Parent
        Yep. Majorly awesome scenario degrades into ads vs adblock when you consider everything in the future not just the self willed robot. Matter of fact, a lot of work is put into constructing convincing strings of audio and visual stimuli, and into ignoring those strings.
        Expand this thread
        David_Gerard 12 Apr 2012 12:12 UTC
        3 points
        Parent
        Superstimuli and the Collapse of Western Civilization. Using such skills to manipulate other humans appears to be what we grew intelligence for, of course. As I note, western civilisation is already basically made of the most virulent toxic memes we can come up with. In the noble causes of selling toothpaste and car insurance and, of course, getting laid. It seems to be what we do now we’ve more or less solved the food and shelter problems.
        TheOtherDave 12 Apr 2012 14:12 UTC
        0 points
        Parent
        They’d probably have to be more convincing, since convincing a human being out of a position they already hold is usually a more difficult task than convincing them to hold the position in the first place.
        Expand this thread
        XiXiDu 12 Apr 2012 15:13 UTC
        1 point
        Parent
        
        They’d probably have to be more convincing, since convincing a human being out of a position they already hold is usually a more difficult task than convincing them to hold the position in the first place.
        
        If I have a superhuman answering machine on one side. A tool that just lists a number of answers to my query, just like a superhuman Google. And on the other side I have the same tool in agent mode. Then why would I be more convinced by the agent mode output?
        
        An agent has an incentive to trick me. While the same algorithm, minus the agency module, will just output unbiased answers to my queries.
        
        If the answers between the tool and agent mode differ, then I naturally believe the tool mode output.
        
        If for example the agent mode was going to drivel something about acausal trade and the tool mode would just output some post by Eliezer Yudkowsky explaining why I shouldn’t let the AI out of the box, then how could the agent mode possible be more convincing? Especially since putting the answering algorithm into agent mode shouldn’t improve the answers.
        TheOtherDave 12 Apr 2012 15:41 UTC
        0 points
        Parent
        
        why would I be more convinced by the agent mode output?
        
        You wouldn’t, necessarily. Nor did I suggest that you would.
        
        I also agree that if (AI in “agent mode”) does not have any advantages over (“tool mode” plus human agent), then there’s no reason to expect its output to be superior, though that’s completely tangential to the comment you replied to.
        
        That said, it’s not clear to me that (AI in “agent mode”) necessarily lacks advantages over (“tool mode” plus human agent).
        TheOtherDave 12 Apr 2012 15:28 UTC
        0 points
        Parent
        
        why would I be more convinced by the agent mode output?
        
        You wouldn’t, necessarily. Nor did I suggest that you would.
        
        I also agree that if (AI in “agent mode”) does not have any advantages over (“tool mode” plus human agent), then there’s no reason to expect its output to be superior, though that’s completely tangential to the comment you replied to.
        
        That said, it’s not clear to me that (AI in “agent mode”) necessarily lacks advantages over (“tool mode” plus human agent).
        XiXiDu 12 Apr 2012 16:30 UTC
        0 points
        Parent
        
        ...though that’s completely tangential to the comment you replied to.
        
        I don’t think that anyone with the slightest idea that an AI in agent mode could have malicious intentions, and therefore give biased answers, wouldn’t be as easily swayed by counter-arguments made by a similarly capable algorithm.
        
        I mean, we shouldn’t assume an idiot gatekeeper who never heard of anything we’re talking about here. So the idea that an AI in agent mode could brainwash someone to the extent that it afterwards takes even stronger arguments to undo it seems rather far-fetched (ETA What’s it supposed to say? That the tool uses the same algorithms as itself but is somehow wrong in claiming that the AI in agent mode tries to brainwash the gatekeeper?).
        
        The idea is that given a sufficiently strong AI in tool mode, it might be possible to counter any attempt to trick a gatekeeper. And in the case that the tool mode agrees, then it probably is a good idea to let the AI out of the box. Although anyone familiar with the scenario would probably rather assume a systematic error elsewhere, e.g. a misinterpretation of one’s questions by the AI in tool mode.
        TheOtherDave 12 Apr 2012 16:47 UTC
        0 points
        Parent
        
        I don’t think that anyone with the slightest idea that an AI in agent mode could have malicious intentions, and therefore give biased answers, wouldn’t be as easily swayed by counter-arguments made by a similarly capable algorithm.
        
        Ah, I see. Sure, OK, that’s apposite. Thanks for clarifying that.
        
        I disagree with your prediction.
        XiXiDu 12 Apr 2012 11:06 UTC
        3 points
        Parent
        
        Keep in mind that we still have our tools that make us ‘vastly superhuman’ circa 2012 , and i frankly don’t see how ‘automatic will’, for lack of better term, is contributing anything here that would make the fully automated system competitive.
        
        This is actually one of Greg Egan’s major objections. That superhuman tools come first and that artificial agency won’t make those tools competitive against augmented humans. Further, you can’t apply any work done to ensure that an artificial agents is friendly to augmented humans.
- Turgurth 12 Apr 2012 5:44 UTC
  2 points
  Parent
  I have a few questions, and I apologize if these are too basic:
  
  1) How concerned is SI with existential risks vs. how concerned is SI with catastrophic risks?
  
  2) If SI is solely concerned with x-risks, do I assume correctly that you also think about how cat. risks can relate to x-risks (certain cat. risks might raise or lower the likelihood of other cat. risks, certain cat. risks might raise or lower the likelihood of certain x-risks, etc.)? It must be hard avoiding the conjunction fallacy! Or is this sort of thing more what the FHI does?
  
  3) Is there much tension in SI thinking between achieving FAI as quickly as possible (to head off other x-risks and cat. risks) vs. achieving FAI as safely as possible (to head off UFAI), or does one of these goals occupy signficantly more of your attention and activities?
  
  Edited to add: thanks for responding!
  - CarlShulman 12 Apr 2012 6:03 UTC
    8 points
    Parent
    
    How concerned is SI with existential risks vs. how concerned is SI with catastrophic risks?
    
    Different people have different views. For myself, I care more about existential risks than catastrophic risks, but not overwhelmingly so. A global catastrophe would kill me and my loved ones just as dead. So from the standpoint of coordinating around mutually beneficial policies, or “morality as cooperation” I care a lot about catastrophic risk affecting current and immediately succeeding generations. However, when I take a “disinterested altruism” point of view x-risk looms large: I would rather bring 100 trillion fantastic lives into being than improve the quality of life of a single malaria patient.
    
    If SI is solely concerned with x-risks, do I assume correctly that you also think about how cat. risks can relate to x-risks
    
    Yes.
    
    Or is this sort of thing more what the FHI does?
    
    They spend more time on it, relatively speaking.
    
    FAI as quickly as possible (to head off other x-risks and cat. risks) vs. achieving FAI as safely as possible (to head off UFAI)
    
    Given that powerful AI technologies are achievable in the medium to long term, UFAI would seem to me be a rather large share of the x-risk, and still a big share of the catastrophic risk, so that speedups are easily outweighed by safety gains.
    - multifoliaterose 14 Apr 2012 0:07 UTC
      2 points
      Parent
      
      However, when I take a “disinterested altruism” point of view x-risk looms large: I would rather bring 100 trillion fantastic lives into being than improve the quality of life of a single malaria patient.
      
      What’s your break even point for “bring 100 trillion fantastic lives into being with probability p” vs. “improve the quality of a single malaria patient” and why?
      - CarlShulman 14 Apr 2012 0:20 UTC
        0 points
        Parent
        It depends on the context (probability distribution over number and locations and types of lives), with various complications I didn’t want to get into in a short comment.
        
        Here’s a different way of phrasing things: if I could trade off probability p1 of increasing the income of everyone alive today (but not providing lasting benefits into the far future) to at least $1,000 per annum with basic Western medicine for control of infectious disease, against probability p2 of a great long-term posthuman future with colonization, I would prefer p2 even if it was many times smaller than p1. Note that those in absolute poverty are a minority of current people, a tiny minority of the people who have lived on Earth so far, their life expectancy is a large fraction of that of the rich, and so forth.
- steven0461 12 Apr 2012 0:21 UTC
  1 point
  Parent
  
  Nanotech x-risk would seem to come out of mass-producing weapons that kill survivors of an all out war (which leaves neither side standing), like systems that could replicate in the wild and destroy the niche of primitive humans, really numerous robotic weapons that would hunt down survivors over time, and such like.
  
  What about takeover by an undesirable singleton? Also, if nanotechnology enables AI or uploads, that’s an AI risk, but it might still involve unique considerations we don’t usually think to talk about. The opportunities to reduce risk here have to be very small to justify LessWrong’s ignoring the topic almost entirely, as it seems to me that it has. The site may well have low-hanging conceptual insights to offer that haven’t been covered by CRN or Foresight.
  - CarlShulman 12 Apr 2012 0:34 UTC
    10 points
    Parent
    
    to justify LessWrong’s ignoring the topic
    
    That’s a much lower standard than “should Luke make this a focus when trading breadth vs speed in making his document”. If people get enthused about that, they’re welcome to. I’ve probably put 50-300 hours (depending on how inclusive a criterion I use for relevant hours) into the topic, and saw diminishing returns. If I overlap with Eric Drexler or such folk at a venue I would inquire, and I would read a novel contribution, but I’m not going to be putting much into it given my alternatives soon.
    - steven0461 12 Apr 2012 1:09 UTC
      5 points
      Parent
      I agree that it’s a lower standard. I didn’t mean to endorse Wei’s claims in the original post, certainly not based on nanotech alone. If you don’t personally think it’s worth more of your time to pay attention to nanotech, I’m sure you’re right, but it still seems like a collective failure of attention that we haven’t talked about it at all. You’d expect some people to have a pre-existing interest. If you ever think it’s worth it to further describe the conclusions of those 50-300 hours, I’d certainly be curious.
      - CarlShulman 12 Apr 2012 1:12 UTC
        3 points
        Parent
        I’ll keep that in mind.