My formulation of minimising difference is something like the folllowing:
Assume A is the answer given by the oracle
Predict what would happen to the world if the AI was replaced by a program that consisted of a billion NOPs and then something that output: A . Call this W1
When assessing different strategies predict what would happen in the world given that strategy call this W2.
Minimise the difference between W1 and W2
Is this a more succinct formulation or is it missing something?
My formulation of minimising difference is something like the folllowing:
Assume A is the answer given by the oracle
Predict what would happen to the world if the AI was replaced by a program that consisted of a billion NOPs and then something that output: A . Call this W1
When assessing different strategies predict what would happen in the world given that strategy call this W2.
Minimise the difference between W1 and W2
Is this a more succinct formulation or is it missing something?