“You can specify whatever standard of verifiability you want” is vague. You can say “I want to be absolutely right about whether it’s Friendly”, but you can’t have that unless you know what Friendly means, and are smart enough to specify a standard for checking on it.
If you could be sure you had a cooperative AGI which could just give you an FAI, I think you’d have basically solved the problem of creating an FAI.....but that’s the problem you’re trying to get the AGI to solve for you.
“You can specify whatever standard of verifiability you want” is vague. You can say “I want to be absolutely right about whether it’s Friendly”, but you can’t have that unless you know what Friendly means, and are smart enough to specify a standard for checking on it.
If you could be sure you had a cooperative AGI which could just give you an FAI, I think you’d have basically solved the problem of creating an FAI.....but that’s the problem you’re trying to get the AGI to solve for you.