Eliezer Yudkowsky comments on Dreams of Friendliness

Eliezer Yudkowsky 3 Sep 2008 21:40 UTC
1 point
Peter, the best possible version of an Oracle AI is a Friendly Oracle AI where you didn’t skip any of the hard problems—where you guaranteed its self-improvement and taught it what should means, where the AI is checking the distant effects of its own answers and can refuse to answer. Then the question is, if you can do these things, do you still get a substantial safety improvement out of making it a Friendly Oracle AI rather than a Friendly AI? That’s the question I look at once a year.
- TheAncientGeek 18 Sep 2015 12:34 UTC
  −2 points
  Parent
  If that type of full-strength AI is close in algorithmspace to a dangerously unfriendly AI...and you have pretty much argued that it is...then that is not safe, because you cannot rely on complex projects being got right 100% of th etime.