tailcalled comments on The Hardcore AI Box Experiment

tailcalled 30 Mar 2015 22:04 UTC
2 points
I think the fundamental point I’m trying to make is that Eliezer merely demonstrated that humans are too insecure to box an AI and that this problem can be solved by not giving the AI a chance to hack the humans.
- artemium 31 Mar 2015 6:06 UTC
  0 points
  Parent
  Agree.. The AI boxing Is horrible idea for testing AI safety issues. Putting AI in some kind of virtual sandbox where you can watch his behavior is much better option, as long as you can make sure that AGI won’t be able to become aware that he is boxed in.
  - Vaniver 31 Mar 2015 13:22 UTC
    1 point
    Parent
    Agree.. The AI boxing Is horrible idea for testing AI safety issues. Putting AI in some kind of virtual sandbox where you can watch his behavior is much better option, as long as you can make sure that AGI won’t be able to become aware that he is boxed in.
    
    What’s the difference between the AI’s text output channel and you observing the virtual sandbox?
    Is it possible to ensure that the AI won’t realize that it is boxed in?
    Is it possible to ensure that, if the AI does realize that it is boxed in, we will be able to realize that it realizes that?
    
    As I understand it, the main point of the AI Box experiment was not whether or not humans are good gatekeepers, but that people who don’t understand why it would be enticing to let an AI out of the box haven’t fully engaged with the issue. But even how to correctly do a virtual sandbox for an AGI is a hard problem that requires serious attention.
- dxu 30 Mar 2015 23:11 UTC
  0 points
  Parent
  That being said, if you have an AI, only to seal it in a box without interacting with it in any way (which seems the only realistic way to “not [give] the AI a chance to hack the humans”), that’s not much different from not building the AI in the first place.
  - tailcalled 30 Mar 2015 23:31 UTC
    0 points
    Parent
    I’ll post a list of methods soon, probably tomorrow.