Anyone must agree that the first task we want our AI to solve is FAI (even if we are “100%” sure that our plan has no leaks we still would like our AI to check it while we are able to shut AI down). It’s easy to imagine that AI lies about it’s own safety but many AIs lying about their safety (including safety of other AIs!) is much harder to imagine (while certainly still possible but also less probable). Only when we are incredibly sure in our FAI solution we can ask AI to solve other questions for us. Also, those AIs would constantly try to find bad consequences of our main_AI proposals (because they also don’t want to risk their lifes, and also because we ask them to give us this information). Also, certainly we don’t give access to internet and take some precautions considering people interacting with AI etc etc (which is well described in other places).
Certainly, this overall solution still has its drawbacks (I think every solution will have them) and we have to improve it in many ways. In my opinion, it’s good if we don’t launch AI during next 1000 years :-) but the problem is terrorist organizations and mad people that would be able to launch it despite our intentions… so we have to launch AI more or less soon anyway (or get rid of all terrorists and mad clever people which is nearly impossible). So we have to formulate a combination of tricks that is as safe as we can get. I find counter-productive to throw away everything which is not “100%” safe trying to find some magic “100%” super-solution.
Anyone must agree that the first task we want our AI to solve is FAI (even if we are “100%” sure that our plan has no leaks we still would like our AI to check it while we are able to shut AI down). It’s easy to imagine that AI lies about it’s own safety but many AIs lying about their safety (including safety of other AIs!) is much harder to imagine (while certainly still possible but also less probable). Only when we are incredibly sure in our FAI solution we can ask AI to solve other questions for us. Also, those AIs would constantly try to find bad consequences of our main_AI proposals (because they also don’t want to risk their lifes, and also because we ask them to give us this information). Also, certainly we don’t give access to internet and take some precautions considering people interacting with AI etc etc (which is well described in other places).
Certainly, this overall solution still has its drawbacks (I think every solution will have them) and we have to improve it in many ways. In my opinion, it’s good if we don’t launch AI during next 1000 years :-) but the problem is terrorist organizations and mad people that would be able to launch it despite our intentions… so we have to launch AI more or less soon anyway (or get rid of all terrorists and mad clever people which is nearly impossible). So we have to formulate a combination of tricks that is as safe as we can get. I find counter-productive to throw away everything which is not “100%” safe trying to find some magic “100%” super-solution.