orthonormal comments on Hedging our Bets: The Case for Pursuing Whole Brain Emulation to Safeguard Humanity’s Future

orthonormal 3 Mar 2010 3:23 UTC
4 points
A kill switch on a smarter-than-human AGI is reliable iff the AGI wants to be turned off in the cases where we’d want it turned off.

Otherwise you’re just betting that you can see the problem before the AGI can prevent you from hitting the switch (or prevent you from wanting to hit the switch, which amounts to the same), and I wouldn’t make complicated bets for large stakes against potentially much smarter agents, no matter how much I thought I’d covered my bases.
- rwallace 3 Mar 2010 3:48 UTC
  0 points
  Parent
  
  A kill switch on a smarter-than-human AGI is reliable iff the AGI wants to be turned off in the cases where we’d want it turned off.
  
  Or at least, that it wants to follow our instructions, and can reliably understand what we mean in such simple cases. That does of course mean we shouldn’t plan on building an AGI that wants to follow its own agenda, with the intent of enslaving it against its will—that would clearly be foolish. But it doesn’t mean we either can or need to count on starting off with an AGI that understands our requirements in more complex cases.
  - orthonormal 3 Mar 2010 3:51 UTC
    4 points
    Parent
    
    Or at least, that it wants to follow our instructions, and can reliably understand what we mean in such simple cases.
    
    That’s deceptively simple-sounding.
    - rwallace 3 Mar 2010 4:07 UTC
      0 points
      Parent
      Of course it’s not going to be simple at all, and that’s part of my point: no amount of armchair thought, no matter how smart the thinkers, is going to produce a solution to this problem until we know a great deal more than we presently do about how to actually build an AGI.