jsteinhardt comments on Beyond Bayesians and Frequentists

jsteinhardt 31 Oct 2012 19:10 UTC
0 points
Suppose that there are two possible policies A and B, and in the worst case A gives utility 1 and B gives utility 2, but for the specific problem we care about we require a utility of 3. Then an algorithm that optimizes for the worst case will choose B. On the other hand, there is no algorithm (that only chooses between policies A and B) that can guarantee a utility of 3. If you absolutely need a utility of 3 then you’d better come up with a new policy C, or find an additional valid assumption that you can make. The subtlety here is that “optimizing for the worst case” implicitly means “with respect to the current set of assumptions I have encoded into my algorithm, which is probably a subset of the full set of assumptions that I as a human make about the world”.

The notion of guaranteed performance is important because it tells you when you need to do more work and design a better algorithm (for instance, by finding additional regularities of the environment that you can exploit).