jessicata comments on Corrigibility thoughts II: the robot operator

jessicata 20 Jan 2017 2:52 UTC
0 points
AF
What do you find most unsatisfactory about this proposal for having the AI be motivated to maintain the shutdown circuitry? Here the AI does not benefit from influencing the human. I get that there are problems with this proposal, I’m just not sure which one you’re trying to talk about / solve in this post.
- Stuart_Armstrong 24 Jan 2017 12:12 UTC
  0 points
  AF Parent
  In that proposal? The AI is motivated to kill the human to prevent any possible tampering with the shutdown circuitry. If we’ve defined the setup so that someone needs to actively press a button at some point, then killing the human and getting an automated button presser will work.
  
  Protect the circuity doesn’t mean protect the human component of it, unless the human component is defined.
  - jessicata 25 Jan 2017 9:37 UTC
    0 points
    AF Parent
    Makes sense, thanks for clarifying.