Do you run into a distinction between benign and malign tampering at any point? For example, if humans can never tell the difference between the tampered and non-tampered result, and their own sanity has not been compromised, it is not obvious to me that the tampered result is worse than the non-tampered result.
It might be easier to avoid compromising human sanity + use hold-out sensors than to solve ELK in general (though maybe not? I haven’t thought about it much).
But you still need to care about some sensor tampering. In particular, you need to make sure that there are actually happy humans deliberating about what to do (under local conditions that they believe are conducive to figuring out the answer), rather than merely cameras showing happy humans deliberating about what to do.
So, granting the assumption of not corrupting the humans (which is maybe what you are denying), doesn’t this imply that we can go on adding sensors after the fact until, at some point, the difference between fooling them all and being honest becomes unproblematic?
Do you run into a distinction between benign and malign tampering at any point? For example, if humans can never tell the difference between the tampered and non-tampered result, and their own sanity has not been compromised, it is not obvious to me that the tampered result is worse than the non-tampered result.
It might be easier to avoid compromising human sanity + use hold-out sensors than to solve ELK in general (though maybe not? I haven’t thought about it much).
In some sense this is exactly what we want to do, and this is why we are happy with a very “narrow” version of ELK (see the appendices on narrow elicitation and why it might be enough, indirect normativity, and avoiding subtle manipulation).
But you still need to care about some sensor tampering. In particular, you need to make sure that there are actually happy humans deliberating about what to do (under local conditions that they believe are conducive to figuring out the answer), rather than merely cameras showing happy humans deliberating about what to do.
So, granting the assumption of not corrupting the humans (which is maybe what you are denying), doesn’t this imply that we can go on adding sensors after the fact until, at some point, the difference between fooling them all and being honest becomes unproblematic?