paulfchristiano comments on Cryptographic Boxes for Unfriendly AI

paulfchristiano 20 Dec 2010 21:23 UTC
2 points
I think I understand basically your objections, at least in outline. I think there is some significant epistemic probability that they are wrong, but even if they are correct I don’t think it at all rules out the possibility that a boxed unfriendly AI can give you a really friendly AI. My most recent post takes the first steps towards doing this in a way that you might believe.