That sounds like a terrible strategy. Your threat won’t be credible because your goal is to make the world better, not destroy it. And anything you do to make the threat credible (like some sort of precomitment mechanism) will risk the world actually getting destroyed.
It could work as a precautionary measure against existential risk. If someone is planning on doing something that also risks the world getting destroyed, then the threat could be credible.
(I am not endorsing humans actually using this strategy in the real world, obviously).
How would you leverage a button that destroys the world to make the world better?
By blackmailing powerful people into doing good, I assume.
That sounds like a terrible strategy. Your threat won’t be credible because your goal is to make the world better, not destroy it. And anything you do to make the threat credible (like some sort of precomitment mechanism) will risk the world actually getting destroyed.
It could work as a precautionary measure against existential risk. If someone is planning on doing something that also risks the world getting destroyed, then the threat could be credible.
(I am not endorsing humans actually using this strategy in the real world, obviously).
I agree.