Agreed, but that was the point I was trying to make. If you take away the AI’s ability to lie, it gains the advantage that you believe what it says, that it is credible. That is especially dangerous when the AI can make credible threats (which potentially include threats to create simulations, but simpler things work too) and also credible promises if only you’d be so kind as to [whatever helps the AI get what it wants.]
Agreed, but that was the point I was trying to make. If you take away the AI’s ability to lie, it gains the advantage that you believe what it says, that it is credible. That is especially dangerous when the AI can make credible threats (which potentially include threats to create simulations, but simpler things work too) and also credible promises if only you’d be so kind as to [whatever helps the AI get what it wants.]