quanticle comments on Scott Aaronson on “Reform AI Alignment”

quanticle 23 Nov 2022 3:26 UTC
6 points
−1
Meta just unveiled an AI that can play Diplomacy at a human level.

Here we have an AI that, through deception, can persuade humans to ally with it against other humans, and sets them up for eventual betrayal. This is something that Eliezer anticipated with his AI box experiment.
- interstice 23 Nov 2022 4:57 UTC
  3 points
  5
  Parent
  The AI-box experiment and this result are barely related at all—the only connection between them is that they both involve deception in some manner. “There will exist future AI systems which sometimes behave deceptively” can hardly be considered a meaningful advance prediction.