This is both a problem and a solution because it makes the AI weaker. A weaker AI would be good because it would allow us to more easily transition to safer versions of FAI than we would otherwise come up with independently. I think that delaying a FAI is obviously much better than unleashing a UFAI. My entire goal throughout this conversation has been to think of ways that would make hostile FAIs weaker, I don’t know why you think this is a relevant counter objection.
You assert that it will just route around the deontological rules, that’s nonsense and a completely unwarranted assumption, try to actually back up what you’re asserting with arguments. You’re wrong. It’s obviously possible to program things (eg people) such that they’ll refuse to do certain things no matter what the consequences (eg you wouldn’t murder trillions of babies to save billions of trillions of babies, because you’d go insane if you tried because your body has such strong empathy mechanisms and you inherently value babies a lot). This means that we wouldn’t give the AI unlimited control over its source code, of course, we’d make the part that told it to be a deontologist who likes text channels be unmodifiable. That specific drawback doesn’t jive well with the aesthetic of a super powerful AI that’s master of itself and the universe, I suppose, but other than that I see no drawback. Trying to build things in line with that aesthetic actually might be a reason for some of the more dangerous proposals in AI, maybe we’re having too much fun playing God and not enough despair.
I’m a bit cranky in this comment because of the time sink that I’m dealing with to post these comments, sorry about that.
This is both a problem and a solution because it makes the AI weaker. A weaker AI would be good because it would allow us to more easily transition to safer versions of FAI than we would otherwise come up with independently. I think that delaying a FAI is obviously much better than unleashing a UFAI. My entire goal throughout this conversation has been to think of ways that would make hostile FAIs weaker, I don’t know why you think this is a relevant counter objection.
You assert that it will just route around the deontological rules, that’s nonsense and a completely unwarranted assumption, try to actually back up what you’re asserting with arguments. You’re wrong. It’s obviously possible to program things (eg people) such that they’ll refuse to do certain things no matter what the consequences (eg you wouldn’t murder trillions of babies to save billions of trillions of babies, because you’d go insane if you tried because your body has such strong empathy mechanisms and you inherently value babies a lot). This means that we wouldn’t give the AI unlimited control over its source code, of course, we’d make the part that told it to be a deontologist who likes text channels be unmodifiable. That specific drawback doesn’t jive well with the aesthetic of a super powerful AI that’s master of itself and the universe, I suppose, but other than that I see no drawback. Trying to build things in line with that aesthetic actually might be a reason for some of the more dangerous proposals in AI, maybe we’re having too much fun playing God and not enough despair.
I’m a bit cranky in this comment because of the time sink that I’m dealing with to post these comments, sorry about that.