Veedrac comments on “Diamondoid bacteria” nanobots: deadly threat or dead-end? A nanotech investigation

Veedrac 30 Sep 2023 21:36 UTC
47 points
18
I’m not sure how to put this, but while this post is framed as a response to AI risk concerns, those concerns are almost entirely ignored in favor of looking at how plausible it is for near-term human research to achieve it, and only at the end is it connected back to AI risk via a brief aside whose crux is basically that you don’t think Yudkowsky-style ASI will exist.

I like a lot of the discussion if I frame it in my head to be about what it is actually arguing for. Taking it as given, it seems instead broadly non-sequiter, as the evidence given basically doesn’t relate to resolving the disagreement.
- titotal 1 Oct 2023 15:52 UTC
  21 points
  15
  Parent
  At no point did I ever claim that this was a conclusive debunking of AI risk as a whole, only an investigation into one specific method proposed by Yudkowksy as an AI death dealer.
  In my post I have explained what DMS is, why it was proposed as a technology, how far along the research went, the technical challenges faced in it’s construction, some observations of how nanotech research works, the current state of nanotech research, what near-term speedups can be expected from machine learning, and given my own best guess on whether an AGI could pull off inventing MNT in a short timeframe, based on what was learned.
  This is only “broadly non-sequiter” if you think that none of that information is relevant for assessing the feasibility of diamondoid bacteria AI weapons, which strikes me as somewhat ridiculous.
  - Veedrac 1 Oct 2023 18:24 UTC
    3 points
    1
    Parent
    Rather than focusing on where I disagree with this, I want to emphasize the part where I said that I liked a lot of the discussion if I frame it in my head differently. I think if you opened the Introduction section with the second paragraph of this reply (“In my post I have explained”), rather than first quoting Yudkowsky, you’d set the right expectations going into it. The points you raise are genuinely interesting, and tons of people have worldviews that this would be much more convincing to than Yudkowsky’s.
    What links here?
    Veedrac's comment on “Diamondoid bacteria” nanobots: deadly threat or dead-end? A nanotech investigation by titotal (1 Oct 2023 18:55 UTC; 2 points)
- 1a3orn 1 Oct 2023 13:04 UTC
  13 points
  3
  Parent
  What would qualify as an evidence against how ASI can do a thing, apart from pointing out the actual physical difficulties in doing the thing?
  - Veedrac 1 Oct 2023 18:09 UTC
    7 points
    1
    Parent
    
    apart from pointing out the actual physical difficulties in doing the thing
    
    This excludes most of the potential good arguments! If you can show that large areas of the solution space seem physically unrealizable, that’s an argument that potentially generalizes to ASI. For example, I think people can suggest good limits on how ASI could and couldn’t traverse the galaxy, and trivially rule out threats like ‘the AI crashes the moon into Earth’, because of physical argument.
    
    To hypothesize an argument of this sort that might be persuasive, at least to people able to verify such claims: ‘Synthesis of these chemicals is not energetically feasible at these scales because these bonds take $X energy to form, but it’s only feasible to store $Y energy in available bonds. This limits you to a very narrow set of reactions which seems unable to produce the desired state. Thus larger devices are required, absent construction under an external power source.’ I think a similar argument could plausibly exist around object stickiness, though I don’t have the chemistry knowledge to give a good framing for how that might look.
    
    There aren’t as many arguments once we exclude physical arguments. If you wanted to argue that it was plausibly physically realizable but that strong ASI wouldn’t figure it out, I suppose some in-principle argument that it involves solving a computationally intractable challenge in leu of experiment might work, though that seems hard to argue in reality.
    
    It’s generally hard to use weaker claims to limit far ASI, because, being by definition qualitatively and quantitatively smarter than us, it can reason about things in ways that we can’t. I’m happy to think there might exist important, practically-solvable-in-principle tasks that an ASI fails to solve, but it seems implausible for me to know ahead of time which tasks those are.
- Portia 1 Oct 2023 18:28 UTC
  3 points
  0
  Parent
  I think the text is mostly focussed on the problems humans have run into when building this stuff, because these are known and hence our only solid empirical detailed basis, while the problems AI would run into when building this stuff are entirely hypothetical.
  It then makes a reasonable argument that AI probably won’t be able to circumvent these problems, because higher intelligence and speed alone would not plausibly fix them, and in fact, a plausible fix might have to be slow, human-mediated, and practical.
  One can disagree with that conclusion, but as for the approach, what alternative would you propose when trying to judge AI risk?
  - Veedrac 1 Oct 2023 18:55 UTC
    2 points
    0
    Parent
    I think I implicitly answered you elsewhere, though I’ll add a more literal response to your question here.
    
    On a personal level, none of this is relevant to AI risk. Yudkowsky’s interest in it seems like more of a byproduct of his reading choices when he was young and impressionable than anything else, which is not reading I shared. Neither he nor I think this is necessary for xrisk scenarios, with me probably being on the more skeptical side, and me believing more in practical impediments that strongly encourage doing the simple things that work, eg. conventional biotech.
    
    Due to this not being a crux and not having the same personal draw towards discussing it, I basically don’t think about this when I think about modelling AI risk scenarios. I think about it when it comes up because it’s technically interesting. If someone is reasoning about this because they do think it’s a crux for their AI risk scenarios, and they came to me for advice, I’d suggest testing that crux before I suggested being more clever about de novo nanotech arguments.
- faul_sname 1 Oct 2023 0:45 UTC
  3 points
  0
  Parent
  My impression is that the nanotech is a load bearing part of the “AI might kill all humans with no warning and no signs ofpreparation” story—specifically, “a sufficiently smart ASI could quickly build itself more computing substrate without having to deal with building and maintaining global supply chains, and doing so would be the optimal course of action” seems like the sort of thing that’s probably true if it’s possible to build self- replicating nanofactories without requiring a bunch of slow, expensive, serial real-world operations to get the first one built and debugged, and unlikely to be true if not.
  
  That’s not to say human civilisation is invulnerable, just that “easy” nanotech is a central part of the “everyone dies with no warning and the AI takes over the light cone uncontested” story.
  - Veedrac 1 Oct 2023 1:41 UTC
    6 points
    3
    Parent
    I was claiming that titotal’s post doesn’t appear to give arguments that directly address whether or not Yudkowsky-style ASI can invent diamondoid nanotech. I don’t understand the relevance to my comment. I agree that if you find titotal’s argument persuasive then whether it is load bearing is relevant to AI risk concerns, but that’s not what my comment is about.
    
    FWIW Yudkowsky frequently says that this is not load bearing, and that much seems obviously true to me also.
    - adastra22 4 Mar 2024 8:32 UTC
      3 points
      2
      Parent
      .
      - Veedrac 4 Mar 2024 10:20 UTC
        2 points
        0
        Parent
        Could you quote or else clearly reference a specific argument from the post you found convincing on that topic?
        adastra22 4 Mar 2024 23:04 UTC
        2 points
        0
        Parent
        .
        Veedrac 5 Mar 2024 2:23 UTC
        5 points
        1
        Parent
        Well yes, nobody thinks that existing techniques suffice to build de-novo self-replicating nano machines, but that means it’s not very informative to comment on the fallibility of this or that package or the time complexity of some currently known best approach without grounding in the necessity of that approach.
        
        One has to argue instead based on the fundamental underlying shape of the problem, and saying accurate simulation is O(n⁷) is not particularly more informative to that than saying accurate protein folding is NP. I think if the claim is that you can’t make directionally informative predictions via simulation for things meaningfully larger than helium then one is taking the argument beyond where it can be validly applied. If the claim is not that, it would be good to hear it clearly stated.
        adastra22 5 Mar 2024 9:10 UTC
        4 points
        1
        Parent
        .
        Veedrac 5 Mar 2024 17:41 UTC
        2 points
        0
        Parent
        And what reason do you have for thinking it can’t be usefully approximated in some sufficiently productive domain, that wouldn’t also invalidly apply to protein folding? I think it’s not useful to just restate that there exist reasons you know of, I’m aiming to actually elicit those arguments here.
        adastra22 5 Mar 2024 21:36 UTC
        3 points
        0
        Parent
        .
        Veedrac 10 Mar 2024 2:36 UTC
        5 points
        0
        Parent
        Thanks, I appreciate the attempt to clarify. I do though think there’s some fundamental disagreement about what we’re arguing over here that’s making it less productive than it could be. For example,
        The fact that this has been an extremely active area of research for over 80 years with massive real-world implications, and we’re no closer to finding such a simplified heuristic.
        I think both:
        Lack of human progress doesn’t necessarily mean the problem is intrinsically unsolvable by advanced AI. Humans often take a bunch of time before proving things.
        It seems not at all the case that algorithmic progress isn’t happening, so it’s hardly a given that we’re no closer to a solution unless you first circularly assume that there’s no solution to arrive at.
        If you’re starting out with an argument that we’re not there yet, this makes me think more that there’s some fundamental disagreement about how we should reason about ASI, more than your belief being backed by a justification that would be convincing to me had only I succeeded at eliciting it. Claiming that a thing is hard is at most a reason not to rule out that it’s impossible. It’s not a reason on its own to believe that it is impossible.
        With regard to complexity,
        I failed to understand the specific difference with protein folding. Protein folding is NP-hard, which is significantly harder than O(n³).
        I failed to find the source for the claim that O(n³) or O(n⁴) are optimal. Actually I’m pretty confused how this is even a likely concept; surely if the O(n³) algorithm is widely useful then the O(n⁴) proof can’t be that strong of a bound on practical usefulness? So why is this not true of the O(n³) proof as well?
        It’s maybe true that protein folding is easier to computationally verify solutions to, but first, can you prove this, and second, on what basis are you claiming that existing knowledge is necessarily insufficient to develop better heuristics than the ones we already have? The claim doesn’t seem to complete to me.
        It’s magical thinking to assume that an AI will just one-shot this into existence.
        Please note that I’ve not been making the claim that ASI could necessarily solve this problem. I have been making the claim that the arguments in this post don’t usefully support the claim that it can’t. It is true that largely on priors I expect it should be able to, but my priors also aren’t particularly useful ones to this debate and I have tried to avoid making comments that are dependent on them.
        Expand this thread
        adastra22 10 Mar 2024 18:28 UTC
        10 points
        2
        Parent
        .
        Veedrac 10 Mar 2024 19:10 UTC
        5 points
        1
        Parent
        If you say “Indeed it’s provable that you can’t have a faster algorithm than those O(n^3) and O(n^4) approximations which cover all relevant edge cases accurately” I am quite likely to go on a digression where I try to figure out what proof you’re pointing at and why you think it’s a fundamental barrier, and it seems now that per a couple of your comments you don’t believe it’s a fundamental barrier, but at the same time it doesn’t feel like any position has been moved, so I’m left rather foggy about where progress has been made.
        I think it’s very useful that you say
        I’m not saying that AI can’t develop useful heuristic approximations for the simulation of gemstone-based nano-mechanical machinery operating in ultra-high vacuum. I’m saying that it can’t do so as a one-shot inference without any new experimental work
        since this seems like a narrower place to scope our conversation. I read this to mean:
        You don’t know of any in principle barrier to solving this problem,
        You believe the solution is underconstrained by available evidence.
        I find the second point hard to believe, and don’t really see anywhere you have evidenced it.
        As a maybe-relevant aside to that, wrt.
        You’re saying that AI could take the garbage and by mere application of thought turn it into something useful. That’s not in line with the actual history of the development of useful AI outputs.
        I think you’re talking of ‘mere application of thought’ like it’s not the distinguishing feature humanity has. I don’t care what’s ‘in line with the actual history’ of AI, I care what a literal superintelligence could do, and this includes a bunch of possibilities like:
        Making inhumanly close observation of all existing data,
        Noticing new, inhumanly-complex regularities in said data,
        Proving new simplifying regularities from theory,
        Inventing new algorithms for heuristic simulation,
        Finding restricted domains where easier regularities hold,
        Bifurcating problem space and operating over each plausible set,
        Sending an interesting email to a research lab to get choice high-ROI data.
        We can ignore the last one for this conversation. I still don’t understand why the others are deemed unreasonable ways of making progress on this task.
        I appreciated the comments on time complexity but am skipping it because I don’t expect at this point that it lies at the crux.
        adastra22 10 Mar 2024 19:39 UTC
        2 points
        1
        Parent
        .
        ryan_greenblatt 10 Mar 2024 21:46 UTC
        4 points
        2
        Parent
        One quick intuition pump: do you think a team of 10,000 of the smartest human engineers and scientists could do this if they had perfect photographic memory, were immortal, and could think for a billion years?
        
        To keep the situation analogous to an AI needing to do this quickly, we’ll suppose this team of humans is subject to the same restrictions on compute for simulations (e.g. over the entire billion years they only get 10^30 FLOP) and also can only run whatever experiments the AI would get to run (effectively no interesting experiments).
        
        I feel uncertain about whether this team would succeed, but it does seem pretty plausible that they would be able to succeed. Perhaps I think they’re 40% likely to succeed?
        
        Now, suppose the superintelligence is like this, but even more capable.
        
        See also That Alien Message
        
        Separately, I don’t think it’s very important to know what an extremely powerful superintelligence could do, because prior to the point where you have an extremely powerful superintelligence, humanity will already be obsoleted by weaker AIs. So, I think Yudkowsky’s arguments about nanotech are mostly unimportant for other reasons.
        
        But, if you did think “well, sure the AI might be arbitrarily smart, but if we don’t give it access to the nukes what can it even do to us?” then I think that there are many sources of concern and nanotech is certainly one of them.
        adastra22 11 Mar 2024 1:45 UTC
        4 points
        0
        Parent
        .
        ryan_greenblatt 11 Mar 2024 4:20 UTC
        4 points
        2
        Parent
        
        That’s a more comparable example.
        
        I don’t understand where your confidence is coming from here, but fair enough. It wasn’t clear to me if your take was more like “wildly, wildly superintelligent AI will be considerably weaker than a team of humans thinking for a billion years” or more like “literally impossible without either experiments or >>10^30 FLOP”.
        
        I generally have an intuition like “it’s really, really hard to rule out physically possible things out without very strong evidence, by default things have a reasonable chance of being possible (e.g. 50%) when sufficient intelligence is applied if they are physically possible”. It seems you don’t share this intuition, fair enough.
        
        (I feel like this applies for nearly all human inventions? Like if you had access to a huge amount of video of the world from 1900 and all written books that existed at this point, and had the affordances I described with a team of 10,000 people, 10^30 FLOP, and a billion years, it seems to me like there is a good chance you’d be able to one-shot reinvent ~all inventions of modern humanity (not doing everything in the same way, in many cases you’d massively over engineer to handle one-shot). Planes seem pretty easy? Rockets seem doable?)
        
        Please don’t use That Alien Message as an intuition pump.
        
        I think this is an ok, but not amazing intuition pump for what wildly, wildly superintelligent AI could be like.
        
        The best thing you can do is rid yourself of the notion that superhuman AI would have arbitrary capabilities
        
        I separately think it’s not very important to think about the abilities of wildly, wildly superintelligent AI for most purposes (as I noted in my comment). So I agree that imagining arbitrary capabilities is probablematic. (For some evidence that this isn’t post-hoc justification, see this post on which I’m an author.)
        
        PS: We’ve had AGI since 2017. That’d better be compatible with your world view if you want accurate predictions.
        
        Uhhhh, I’m not sure I agree with this as it doesn’t seem like nearly all jobs are easily fully automatable by AI. Perhaps you use a definition of AGI which is much weaker like “able to speak slightly coherant english (GPT-1?) and classify images”?
        adastra22 11 Mar 2024 5:55 UTC
        0 points
        −1
        Parent
        .
        What links here?
        Modern Transformers are AGI, and Human-Level by abramdemski (26 Mar 2024 17:46 UTC; 219 points)
        ryan_greenblatt 11 Mar 2024 6:14 UTC
        4 points
        0
        Parent
        
        then I suggest you taboo the term AGI
        
        FWIW, I do taboo this term and thus didn’t use it in this conversation until you introduced it.
        adastra22 11 Mar 2024 6:27 UTC
        1 point
        0
        Parent
        .
        ryan_greenblatt 11 Mar 2024 7:03 UTC
        3 points
        0
        Parent
        Definition in the OpenAI Charter:
        
        artificial general intelligence (AGI)—by which we mean highly autonomous systems that outperform humans at most economically valuable work
        
        A post on the topic by Richard (AGI = beats most human experts).
        ryan_greenblatt 11 Mar 2024 7:11 UTC
        2 points
        0
        Parent
        
        My assumption is that when people say AGI here they mean Bostrom’s ASI, and they got linked because Eliezer believed (and believes still?) that AGI will FOOM into ASI almost immediately, which it has not.
        
        In case this wasn’t clear from early discussion, I disagree with Eliezer on a number of topics, including takeoff speeds. In particular I disagree about the time from AI that is economically transformative to AI that is much, much more powerful.
        
        I think you’ll probably find it healthier and more productive to not think of LW as an amorphous collective and instead note that there are a variety of different people who post on the forum with a variety of different views. (I sometimes have made this mistake in the past and I find it healthy to clarify at least internally.)
        
        E.g. instead of saying “LW has bad views about X” say “a high fraction of people who comment on LW have bad views about X” or “a high fraction of karma votes seem to be from people with bad views about X”. Then, you should maybe double check the extent to which a given claim is actualy right : ). For instance, I don’t think almost immediate FOOM is the typical view on LW when aggregating by most metrics, a somewhat longer duration takeoff is now a more common view I think.
        ryan_greenblatt 11 Mar 2024 7:11 UTC
        2 points
        0
        Parent
        Also, I’m going to peace out of this discussion FYI.
        ryan_greenblatt 11 Mar 2024 6:09 UTC
        2 points
        0
        Parent
        
        By the way, where’s this number coming from? (10^30 FLOP) You keep repeating it.
        
        Extremely rough and slightly conservatively small ball park number for how many FLOP will be used to create powerful AIs. The idea being that this will represent roughly how many FLOP could plausibly be available at the time.
        
        GPT-4 is ~10^26 FLOP, I expect GPT-7 is maybe 10^30 FLOP.
        
        Perhaps this is a bit too much because the scheming AI will have access to far few FLOP than exist at the time, but I expect this isn’t cruxy, so I just did a vague guess.
        ryan_greenblatt 11 Mar 2024 6:07 UTC
        2 points
        0
        Parent
        
        Why? You’ve gone into circular logic here.
        
        I wasn’t trying to justify anything, just noting my stance.
        Veedrac 10 Mar 2024 20:28 UTC
        4 points
        0
        Parent
        I greatly appreciate the effort in this reply, but I think it’s increasingly unclear to me how to make efficient progress on our disagreements, so I’m going to hop.