If an artificial intelligence is smart enough to be dangerous, we’d intuitively expect it to be smart enough to know how to make itself safe.
I don’t agree with that. Just looks at humans, they are smart enough to be dangerous, but even when they do want to “make themselves safe”, they are usually unable to do so. A lot of harm is done by people with good intent. I don’t think all of Moliere doctors prescribing bloodletting were intending to do harm.
Yes, a sufficiently smart AI will know how to make itself safe if it wishes, but the intelligence level required for that is much higher than the one required to be harmful.
Agreed. The reason I link the two abilities is that I’m assuming an AI that acquires either power went FOOM, which makes it much more likely that the two powers will arise at (on a human scale) essentially the same time.
If a FAI would have a utility function like “Maximise X while remaining Friendly”, And the UFAI would just have “Maximise X”. Then, If the FAI and a UFAI would be initiated simultaneously, I would expect them both to develop exponentially, but the UFAI would have more options available, thus have a steeper learning curve. So I’d expect that in this situation that the UFAI would go FOOM slightly sooner, and be able to disable the FAI.
I don’t agree with that. Just looks at humans, they are smart enough to be dangerous, but even when they do want to “make themselves safe”, they are usually unable to do so. A lot of harm is done by people with good intent. I don’t think all of Moliere doctors prescribing bloodletting were intending to do harm.
Yes, a sufficiently smart AI will know how to make itself safe if it wishes, but the intelligence level required for that is much higher than the one required to be harmful.
Agreed. The reason I link the two abilities is that I’m assuming an AI that acquires either power went FOOM, which makes it much more likely that the two powers will arise at (on a human scale) essentially the same time.
If a FAI would have a utility function like “Maximise X while remaining Friendly”, And the UFAI would just have “Maximise X”. Then, If the FAI and a UFAI would be initiated simultaneously, I would expect them both to develop exponentially, but the UFAI would have more options available, thus have a steeper learning curve. So I’d expect that in this situation that the UFAI would go FOOM slightly sooner, and be able to disable the FAI.