No77e comments on What’s wrong with the paperclips scenario?

No77e 7 Jan 2023 18:45 UTC
2 points
0
Do you mean that no one will actually create exactly a paperclips maximizer or no agent of that kind? I.e. with goals such as “collect stamps”, or “generate images”? Because I think Eliezer meant to object to that class of examples, rather than only that specific one, but I’m not sure.
- DragonGod 7 Jan 2023 20:13 UTC
  8 points
  3
  Parent
  We probably wouldn’t uncritically let loose an AI whose capability was to maximise the quantity of some physical stuff (paperclips, stamps, etc.). If we make a (very) stupid outer alignment failure, we’re more likely to train an AI to maximise “happiness” or similar.
  - No77e 7 Jan 2023 20:16 UTC
    6 points
    2
    Parent
    I agree with you here, although something like “predict the next token” seems more and more likely. Although I’m not sure if this is in the same class of goals as paperclip maximizing in this context, and if the kind of failure it could lead to would be similar or not.