PhilGoetz comments on Friendly, but Dumb: Why formal Friendliness proofs may not be as safe as they appear

PhilGoetz 22 Apr 2010 0:28 UTC
5 points

The argument centers around the fallibility of some (naive) formal proofs of Friendliness which I’ve seen people discussing the AI box problem willing to accept.

This post might make more sense to me if you presented these proofs.