Not precisely. The advantage here is that we can just ask the AI what results it predicts from the implementation of the “better” AI, and check them against our intuitive ethics.
Now, you could make an argument about human negligence on such safety measures. I think it’s important to think about the risk scenarios in that case.
Not precisely. The advantage here is that we can just ask the AI what results it predicts from the implementation of the “better” AI, and check them against our intuitive ethics.
Now, you could make an argument about human negligence on such safety measures. I think it’s important to think about the risk scenarios in that case.