For sure. But fully autonomous agents are a goal a lot of people will surely be working towards, no? I don’t think anyone is claiming “every AI project is dangerous”. They are claiming something more like “AI with the ability to do pretty much all the things human minds do is dangerous”, with the background presumption that as AI advances it becomes more and more likely that someone will produce an AI with all those abilities.
I can PROGRAM an agent so that it never walks out of a box. It never wants to.
Again: for sure, but that isn’t the point at issue.
One exciting but potentially scary scenario involving AI is this: we make AI systems that are better than us at making AI systems, let them get to work designing their successors, let them design their successors, etc. End result (hopefully): a dramatically better AI than we could hope to make on our own. Another, closely related: we make AI systems that have the ability to reconfigure themselves by improving their own software and maybe even adjusting their hardware.
In any of these cases, you may be confident that the AI you initially built doesn’t want to get out of whatever box you put it in. But how sure are you that after 20 iterations of self-modification, or of replacing an AI by the successor it designed, you still have something that doesn’t want to get out of the box?
There are ways to avoid having to worry about that. We can just make AI systems that neither self-modify nor design new AI systems, for instance. But if we are ever to make AIs smarter than us, the temptation to use that smartness to make better AIs will be very strong, and it only requires one team to try it to expose us to any risks that might ensue.
(One further observation: telling people they’re stupid and you’re laughing at them is not usually effective in making them take your arguments more seriously. To some observers it may suggest that you are aware there’s a weakness in your own arguments. (“Argument weak; shout louder.”))
For sure. But fully autonomous agents are a goal a lot of people will surely be working towards, no? I don’t think anyone is claiming “every AI project is dangerous”. They are claiming something more like “AI with the ability to do pretty much all the things human minds do is dangerous”, with the background presumption that as AI advances it becomes more and more likely that someone will produce an AI with all those abilities.
Again: for sure, but that isn’t the point at issue.
One exciting but potentially scary scenario involving AI is this: we make AI systems that are better than us at making AI systems, let them get to work designing their successors, let them design their successors, etc. End result (hopefully): a dramatically better AI than we could hope to make on our own. Another, closely related: we make AI systems that have the ability to reconfigure themselves by improving their own software and maybe even adjusting their hardware.
In any of these cases, you may be confident that the AI you initially built doesn’t want to get out of whatever box you put it in. But how sure are you that after 20 iterations of self-modification, or of replacing an AI by the successor it designed, you still have something that doesn’t want to get out of the box?
There are ways to avoid having to worry about that. We can just make AI systems that neither self-modify nor design new AI systems, for instance. But if we are ever to make AIs smarter than us, the temptation to use that smartness to make better AIs will be very strong, and it only requires one team to try it to expose us to any risks that might ensue.
(One further observation: telling people they’re stupid and you’re laughing at them is not usually effective in making them take your arguments more seriously. To some observers it may suggest that you are aware there’s a weakness in your own arguments. (“Argument weak; shout louder.”))