What do you find most unsatisfactory about this proposal for having the AI be motivated to maintain the shutdown circuitry? Here the AI does not benefit from influencing the human. I get that there are problems with this proposal, I’m just not sure which one you’re trying to talk about / solve in this post.
In that proposal? The AI is motivated to kill the human to prevent any possible tampering with the shutdown circuitry. If we’ve defined the setup so that someone needs to actively press a button at some point, then killing the human and getting an automated button presser will work.
Protect the circuity doesn’t mean protect the human component of it, unless the human component is defined.
What do you find most unsatisfactory about this proposal for having the AI be motivated to maintain the shutdown circuitry? Here the AI does not benefit from influencing the human. I get that there are problems with this proposal, I’m just not sure which one you’re trying to talk about / solve in this post.
In that proposal? The AI is motivated to kill the human to prevent any possible tampering with the shutdown circuitry. If we’ve defined the setup so that someone needs to actively press a button at some point, then killing the human and getting an automated button presser will work.
Protect the circuity doesn’t mean protect the human component of it, unless the human component is defined.
Makes sense, thanks for clarifying.