lc comments on Why I hate the “accident vs. misuse” AI x-risk dichotomy (quick thoughts on “structural risk”)

lc 30 Jan 2023 19:31 UTC
LW: 19 AF: 8
4
AF
So what terminology do you want to use to make this distinction then?
- Shmi 30 Jan 2023 21:00 UTC
  6 points
  0
  Parent
  Some auto insurance companies use “collision” instead of “accident” for what transpired, to avoid unintended connotation, separately from assigning responsibility/fault. That part is based on following the letter of the law. In the AI case a better term might be “disaster”, which does not have the connotation of the term “accident”.
  - Rob Bensinger 1 Feb 2023 20:15 UTC
    4 points
    0
    Parent
    If someone deliberately misuses AI to kill lots of people, that’s a “disaster” too.
    - Shmi 1 Feb 2023 20:58 UTC
      2 points
      0
      Parent
      Sure is, separates the description of what happens from assigning responsibility, which I assume is what the OP wanted.
- Evan R. Murphy 31 Jan 2023 18:06 UTC
  4 points
  −6
  Parent
  Instead of “accident”, we could say “gross negligence” or “recklessness” for catastrophic risk from AI misalignment.
  - Rob Bensinger 1 Feb 2023 20:21 UTC
    7 points
    4
    Parent
    Seems to me that this is building in too much content / will have the wrong connotations. If an ML researcher hears about “recklessness risk”, they’re not unlikely to go “oh, well I don’t feel ‘reckless’ at my day job, so I’m off the hook”.
    Locating the issue in the cognition of the developer is probably helpful in some contexts, but it has the disadvantage that (a) people will reflect on their cognition, not notice “negligent-feeling thoughts”, and conclude that accident risk is low; and (b) it encourages people to take the eye off the ball, focusing on psychology (and arguments about whose psychology is X versus Y) rather than focusing on properties of the AI itself.
    “Accident risk” is maybe better just because it’s vaguer. The main problem I see with it isn’t “this sounds like it’s letting the developers off the hook” (since when do we assume that all accidents are faultless?). Rather, I think the problem with “accident” is that it sounds minor.
    Accidentally breaking a plate is an “accident”. Accidentally destroying a universe is… something a bit worse than that.
    - Evan R. Murphy 1 Feb 2023 20:28 UTC
      3 points
      0
      Parent
      Fair point.
      If the issue with “accident” is that it sounds minor*, then one could say “catastrophic accident risk” or similar.
      *I’m not fully bought into this as the main issue, but supposing that it is...
- David Scott Krueger (formerly: capybaralet) 31 Jan 2023 9:53 UTC
  LW: 2 AF: 1
  −12
  AF Parent
  I really don’t think the distinction is meaningful or useful in almost any situation. I think if people want to make something like this distinction they should just be more clear about exactly what they are talking about.
  - Steven Byrnes 31 Jan 2023 14:23 UTC
    LW: 17 AF: 12
    15
    AF Parent
    How about the distinction between (A) “An AGI kills every human, and the people who turned on the AGI didn’t want that to happen” versus (B) “An AGI kills every human, and the people who turned on the AGI did want that to happen”?
    I’m guessing that you’re going to say “That’s not a useful distinction because (B) is stupid. Obviously nobody is talking about (B)”. In which case, my response is “The things that are obvious to you and me are not necessarily obvious to people who are new to thinking carefully about AGI x-risk.”
    …And in particular, normal people sometimes seem to have an extraordinarily strong prior that “when people are talking about x-risk, it must be (B) and not (A), because (A) is weird sci-fi stuff and (B) is a real thing that could happen”, even after the first 25 times that I insist that I’m really truly talking about (A).
    So I do think drawing a distinction between (A) and (B) is a very useful thing to be able to do. What terminology would you suggest for that?
    - Rob Bensinger 1 Feb 2023 20:31 UTC
      LW: 6 AF: 5
      0
      AF Parent
      How about the distinction between (A) “An AGI kills every human, and the people who turned on the AGI didn’t want that to happen” versus (B) “An AGI kills every human, and the people who turned on the AGI did want that to happen”?
      I think the misuse vs. accident dichotomy is clearer when you don’t focus exclusively on “AGI kills every human” risks. (E.g., global totalitarianism risks strike me as small but non-negligible if we solve the alignment problem. Larger are risks that fall short of totalitarianism but still involve non-morally-humble developers damaging humanity’s long-term potential.)
      The dichotomy is really just “AGI does sufficiently bad stuff, and the developers intended this” versus “AGI does sufficiently bad stuff, and the developers didn’t intend this”. The terminology might be non-ideal, but the concepts themselves are very natural.
      It’s basically the same concept as “conflict disaster” versus “mistake disaster”. If something falls into both category to a significant extent (e.g., someone tries to become dictator but fails to solve alignment), then it goes in the “accident risk” bucket, because it doesn’t actually matter what you wanted to do with the AI if you’re completely unable to achieve that goal. The dynamics and outcome will end up looking basically the same as other accidents.
      - David Scott Krueger (formerly: capybaralet) 2 Feb 2023 12:39 UTC
        LW: 2 AF: 1
        0
        AF Parent
        By “intend” do you mean that they sought that outcome / selected for it?
        Or merely that it was a known or predictable outcome of their behavior?
        
        I think “unintentional” would already probably be a better term in most cases.
    - David Scott Krueger (formerly: capybaralet) 1 Feb 2023 18:27 UTC
      LW: 5 AF: 2
      3
      AF Parent
      “Concrete Problems in AI Safety” used this distinction to make this point, and I think it was likely a useful simplification in that context. I generally think spelling it out is better, and I think people will pattern match your concerns onto the “the sci-fi scenario where AI spontaneously becomes conscious, goes rogue, and pursues its own goal” or “boring old robustness problems” if you don’t invoke structural risk. I think structural risk plays a crucial role in the arguments, and even if you think things that look more like pure accidents are more likely, I think the structural risk story is more plausible to more people and a sufficient cause for concern.
      
      RE (A): A known side-effect is not an accident.
  - lc 31 Jan 2023 17:49 UTC
    4 points
    0
    Parent
    A natural misconception lots of normies have is that the primary risks from AI come from bad actors using it explicitly to do evil things, rather than bad actors being unable to align AIs at all and that causing clippy to run wild. I would like to distinguish between these two scenarios and accident vs. misuse risk is an obvious way to do that.