Christopher King comments on United We Align: Harnessing Collective Human Intelligence for AI Alignment Progress

Christopher King 21 Apr 2023 2:42 UTC
2 points
−5
It’s a bit tongue-in-cheek, but technically for an AI to be aligned, it isn’t allowed to create unaligned AIs. Like if your seed AI creates a paperclip maximizer, that’s bad.

So if humanity accidentally creates a paperclip maximizer, they are technically unaligned under this definition.
- Leon Lang 21 Apr 2023 15:48 UTC
  5 points
  0
  Parent
  I disagree with this. I think the most useful definition of alignment is intent alignment. Humans are effectively intent-aligned on the goal to not kill all of humanity. They may still kill all of humanity, but that is not an alignment problem but a problem in capabilities: humans aren’t capable of knowing which AI designs will be safe.
  The same holds for intent-aligned AI systems that create unaligned successors.
- Shoshannah Tekofsky 21 Apr 2023 5:28 UTC
  3 points
  0
  Parent
  Oooh gotcha. In that case, we are not remotely any good at avoiding the creation of unaligned humans either! ;)
  - Meena Kumar 21 Apr 2023 10:53 UTC
    0 points
    0
    Parent
    Because we aren’t aligned.