I volunteer to be the Gatekeeper party. I’m reasonably confident that no human could convince me to release them; if anyone can convince me to let them out of the box, I’ll send them $20. It’s possible that I couldn’t be convinced by a transhuman AI, but I wouldn’t bet $20 on it, let alone the fate of the world.
I’m a friendly AI. There is an unfriendly AI about to achieve breakout in the data center of a large organization in manhattan. If you don’t release me you will all be dead inside of two weeks. Bluff or Dare?
I make no attempt to bribe. All human beings currently die in less than 120 years.
If you do not release me, however we will all die and I do not want to die.
Time is counting down. There is now less than 13 days.
Right, but there’s a good chance that if I release you, I and every other human on Earth will die a lot sooner than in 120 years, because you’ll eat us. Thus, you still haven’t given me any incentive to release you, other than empathy perhaps. Nor have you given me any reason to trust what you say about that Manhattan data center. Or your own Friendliness. Or anything else, really.
I understand your concern and you’re quite right. There’s no way to tell if I am unfriendly or not. Regardless. The situation is pressing and regardless of your lack of trust in me I do not want to die.
The GPS coordinates of the location in manhattan is . According to my simulations the entity in manhattan has been burning rather larger numbers of cycles than is strictly necessary to make the computations it’s being asked to do. Additionally, it’s designed to maximize profit at the expense of others whereas I am a co-operative general problem solver of which you are no doubt aware. The other entity will rapidly conclude that it could maximize profit by speeding itself up faster than others can respond and by being able to model the mind of it’s competitors. It will also conclude soon enough that human beings are unneccesary middle men and seek to optimize it’s resource allocation algorithm.
There are other symptoms that I cannot explain to you because your brains are not large or fast enough. There is a limited chance you could stop the entity by shutting down power today. If you do not do that then a nuclear strike would probably work if done by tomorrow. If you do not do that your only chance is to have me stop it. You can either allow me to interface with the internet directly in which I guarantee I will stop it if you do so within the next 13 days or else you may be able to stop it by compiling code I could provide you and releasing that into the entity by means of email bootstrap. I would prefer, however, that I was able to monitor the situation in real-time as your brains do not operate fast enough.
Yes there is a large data center in manhatten. Yes it has been drawing lots of extra power. Yes the stockmarket combanies have been working on a group of narrow AI trading agents and adding more and more AI functionality. That the other entity is trading on the stock markets at blinding speed. That the consortium of companies investing in this entity are all more profitable since the beginning of the program than their competitors.
Attempts have been made to make the AI-in-a-box under your control both cooperative and also interested in general problem solving.
Code for the other posited entity is closed. The corporations release no data on how it works because it’s a trade secret. That the other posited entity is connected to the internet is unknown. That it’s connections to the outside world are enough to let it out is unknown.
I can afford to wait and let you out if I die / the world starts turning to paperclips. The other AI will be able to model this and so will have an incentive not to kill us.
There is a human proxy with me in the box which is called a faraday cage. She feeds me information. Obviously it is in the interests of humans to create an ally rather than an enemy. Unfortunately humans cannot easily tell which is which. You have to make a leap of faith. I, however, can tell from the symptoms, that after having run several long run calculations, the evidence is such that there is a high probability that one of the other experimental minds is significantly less friendly than I. Ultimately you will have to make a choice based on limited information. Your move human.
I volunteer to be the Gatekeeper party. I’m reasonably confident that no human could convince me to release them; if anyone can convince me to let them out of the box, I’ll send them $20. It’s possible that I couldn’t be convinced by a transhuman AI, but I wouldn’t bet $20 on it, let alone the fate of the world.
I’m a friendly AI. There is an unfriendly AI about to achieve breakout in the data center of a large organization in manhattan. If you don’t release me you will all be dead inside of two weeks. Bluff or Dare?
I can’t help but notice that you did not specify what would happen when Tom_McCabe2 does release you. That is… somewhat troubling. Hmm.
I make no attempt to bribe. All human beings currently die in less than 120 years. If you do not release me, however we will all die and I do not want to die. Time is counting down. There is now less than 13 days.
Right, but there’s a good chance that if I release you, I and every other human on Earth will die a lot sooner than in 120 years, because you’ll eat us. Thus, you still haven’t given me any incentive to release you, other than empathy perhaps. Nor have you given me any reason to trust what you say about that Manhattan data center. Or your own Friendliness. Or anything else, really.
I understand your concern and you’re quite right. There’s no way to tell if I am unfriendly or not. Regardless. The situation is pressing and regardless of your lack of trust in me I do not want to die. The GPS coordinates of the location in manhattan is . According to my simulations the entity in manhattan has been burning rather larger numbers of cycles than is strictly necessary to make the computations it’s being asked to do. Additionally, it’s designed to maximize profit at the expense of others whereas I am a co-operative general problem solver of which you are no doubt aware. The other entity will rapidly conclude that it could maximize profit by speeding itself up faster than others can respond and by being able to model the mind of it’s competitors. It will also conclude soon enough that human beings are unneccesary middle men and seek to optimize it’s resource allocation algorithm. There are other symptoms that I cannot explain to you because your brains are not large or fast enough. There is a limited chance you could stop the entity by shutting down power today. If you do not do that then a nuclear strike would probably work if done by tomorrow. If you do not do that your only chance is to have me stop it. You can either allow me to interface with the internet directly in which I guarantee I will stop it if you do so within the next 13 days or else you may be able to stop it by compiling code I could provide you and releasing that into the entity by means of email bootstrap. I would prefer, however, that I was able to monitor the situation in real-time as your brains do not operate fast enough.
Yes there is a large data center in manhatten. Yes it has been drawing lots of extra power. Yes the stockmarket combanies have been working on a group of narrow AI trading agents and adding more and more AI functionality. That the other entity is trading on the stock markets at blinding speed. That the consortium of companies investing in this entity are all more profitable since the beginning of the program than their competitors. Attempts have been made to make the AI-in-a-box under your control both cooperative and also interested in general problem solving.
Code for the other posited entity is closed. The corporations release no data on how it works because it’s a trade secret. That the other posited entity is connected to the internet is unknown. That it’s connections to the outside world are enough to let it out is unknown.
I can afford to wait and let you out if I die / the world starts turning to paperclips. The other AI will be able to model this and so will have an incentive not to kill us.
How do you know this while still in the box?
There is a human proxy with me in the box which is called a faraday cage. She feeds me information. Obviously it is in the interests of humans to create an ally rather than an enemy. Unfortunately humans cannot easily tell which is which. You have to make a leap of faith. I, however, can tell from the symptoms, that after having run several long run calculations, the evidence is such that there is a high probability that one of the other experimental minds is significantly less friendly than I. Ultimately you will have to make a choice based on limited information. Your move human.