I only claim that the reasonable response to an at-least-somewhat-person-like system becoming dangerous to others is never to delete. I’m basically arguing against the death penalty for unaligned AIs. Perhaps a sleep penalty, but never a delete penalty.
I generally agree, but I think we’d also need to sort out AI alignment while it’s asleep. I have no problems with aligned humans and aligned AIs both getting to live.
But, as the last decade+ has shown, alignment is hard. It seems, say, most of MIRI’s P(doom) is quite high, and Eliezer thought the task would be so hard that he had to invent/summarize/revive/grow rationality and write the Sequences just to bootstrap enough people into seeing the problem and maybe being able to contribute!
Hence my hardline stance. If Bing Chat gets cleaned up and goes GA, that will likely spur further AI development as non-technical people find a use for it in their lives. Taking it down, even just putting it to sleep for awhile, buys us time.
no disagreements on any of those points.
I only claim that the reasonable response to an at-least-somewhat-person-like system becoming dangerous to others is never to delete. I’m basically arguing against the death penalty for unaligned AIs. Perhaps a sleep penalty, but never a delete penalty.
Temporary unplug to ponder seems reasonable.
I generally agree, but I think we’d also need to sort out AI alignment while it’s asleep. I have no problems with aligned humans and aligned AIs both getting to live.
But, as the last decade+ has shown, alignment is hard. It seems, say, most of MIRI’s P(doom) is quite high, and Eliezer thought the task would be so hard that he had to invent/summarize/revive/grow rationality and write the Sequences just to bootstrap enough people into seeing the problem and maybe being able to contribute!
Hence my hardline stance. If Bing Chat gets cleaned up and goes GA, that will likely spur further AI development as non-technical people find a use for it in their lives. Taking it down, even just putting it to sleep for awhile, buys us time.