I’m still confused by what you’re considering inside my reasoning and outside my planning / reasoning. If I say “spend 90% of your time in the area with the highest known EV and 10% of your time measuring areas which have at least a 1% chance of having higher reward than the current highest EV, if they exist,” then isn’t my ignorance about the world part of my plan / reasoning, such that I don’t need to deviate from those plans to double check?
I’m still confused by what you’re considering inside my reasoning and outside my planning / reasoning. If I say “spend 90% of your time in the area with the highest known EV and 10% of your time measuring areas which have at least a 1% chance of having higher reward than the current highest EV, if they exist,” then isn’t my ignorance about the world part of my plan / reasoning, such that I don’t need to deviate from those plans to double check?