DragonGod comments on What’s wrong with the paperclips scenario?

DragonGod 7 Jan 2023 18:40 UTC
6 points
0
To answer the first question, it’s because no one will actually create/train a paperclip maximiser. The scenario holds water if such an AI is created, but none will be.

People scrutinising that hypothetical and rightly dismissing it may overupdate on AI risk not being a serious concern. It’s a problem if the canonical thought experiment of AI misalignment is not very realistic/easily dismissed.

It probably stuck around because of ~founder effects.
- No77e 7 Jan 2023 18:45 UTC
  2 points
  0
  Parent
  Do you mean that no one will actually create exactly a paperclips maximizer or no agent of that kind? I.e. with goals such as “collect stamps”, or “generate images”? Because I think Eliezer meant to object to that class of examples, rather than only that specific one, but I’m not sure.
  - DragonGod 7 Jan 2023 20:13 UTC
    8 points
    3
    Parent
    We probably wouldn’t uncritically let loose an AI whose capability was to maximise the quantity of some physical stuff (paperclips, stamps, etc.). If we make a (very) stupid outer alignment failure, we’re more likely to train an AI to maximise “happiness” or similar.
    - No77e 7 Jan 2023 20:16 UTC
      6 points
      2
      Parent
      I agree with you here, although something like “predict the next token” seems more and more likely. Although I’m not sure if this is in the same class of goals as paperclip maximizing in this context, and if the kind of failure it could lead to would be similar or not.