Wei Dai comments on The Sun is big, but superintelligences will not spare Earth a little sunlight

Wei Dai 25 Sep 2024 15:24 UTC
72 points
38
My reply to Paul at the time:

If a misaligned AI had 1/trillion “protecting the preferences of whatever weak agents happen to exist in the world”, why couldn’t it also have 1/trillion other vaguely human-like preferences, such as “enjoy watching the suffering of one’s enemies” or “enjoy exercising arbitrary power over others”?

From a purely selfish perspective, I think I might prefer that a misaligned AI kills everyone, and take my chances with continuations of myself (my copies/simulations) elsewhere in the multiverse, rather than face whatever the sum-of-desires of the misaligned AI decides to do with humanity. (With the usual caveat that I’m very philosophically confused about how to think about all of this.)

And his response was basically to say that he already acknowledged my concern in his OP:

I’m not talking about whether the AI has spite or other strong preferences that are incompatible with human survival, I’m engaging specifically with the claim that AI is likely to care so little one way or the other that it would prefer just use the humans for atoms.

Personally, I have a bigger problem with people (like Paul and Carl) who talk about AIs keeping people alive, and not talk about s-risks in the same breath or only mention it in a vague, easy to miss way, than I have with Eliezer not addressing Paul’s arguments.
What links here?
- _will_'s comment on MIRI 2024 Communications Strategy by Gretta Duleba (30 Oct 2024 4:40 UTC; 3 points)
- Zack_M_Davis 25 Sep 2024 15:36 UTC
  2 points
  0
  Parent
  Was my “An important caveat” parenthetical paragraph sufficient, or do you think I should have made it scarier?
  - Wei Dai 25 Sep 2024 15:53 UTC
    24 points
    21
    Parent
    Should have made it much scarier. “Superhappies” caring about humans “not in the specific way that the humans wanted to be cared for” sounds better or at least no worse than death, whereas I’m concerned about s-risks, i.e., risks of worse than death scenarios.
    - Zack_M_Davis 25 Sep 2024 18:00 UTC
      5 points
      0
      Parent
      This is a difficult topic (in more ways than one). I’ll try to do a better job of addressing it in a future post.
      - Wei Dai 26 Sep 2024 9:52 UTC
        5 points
        4
        Parent
        To clarify, I don’t actually want you to scare people this way, because I don’t know if people can psychologically handle it or if it’s worth the emotional cost. I only bring it up myself to counteract people saying things like “AIs will care a little about humans and therefore keep them alive” or when discussing technical solutions/ideas, etc.