Thanks for the response. I hope my post didn’t read as defeatist, my point isn’t that we don’t need to try to make AI safe, it’s that if we pick an impossible strategy, no matter how hard we try it won’t work out for us.
So, what’s the reasoning behind your confidence in the statement ‘if we give a superintelligent system the right terminal values it will be possible to make it safe’? Why do you believe that it should principally be possible to implement this strategy so long as we put enough thought and effort into it? Which part of my reasoning do you not find convincing based on how I’ve formulated it? The idea that we can’t keep the AI in the box if it wants to get out, the idea that an AI with terminal values will necessarily end up as an incidentally genocidal paperclip maximizer, or something else entirely that I’m not considering?
Thanks for the response. I hope my post didn’t read as defeatist, my point isn’t that we don’t need to try to make AI safe, it’s that if we pick an impossible strategy, no matter how hard we try it won’t work out for us.
So, what’s the reasoning behind your confidence in the statement ‘if we give a superintelligent system the right terminal values it will be possible to make it safe’? Why do you believe that it should principally be possible to implement this strategy so long as we put enough thought and effort into it?
Which part of my reasoning do you not find convincing based on how I’ve formulated it? The idea that we can’t keep the AI in the box if it wants to get out, the idea that an AI with terminal values will necessarily end up as an incidentally genocidal paperclip maximizer, or something else entirely that I’m not considering?