Perhaps the difference here is one of risk sensitivity—similarly to the way a gambler going strictly for long term gains over the largest number of iterations will use the Kelly Criterion, Michael Vassar optimizes for becoming the least wrong when scores are tallied up at the end of the game. Wei Dai would prefer to minimize the volatility of his wrongness instead, taking smaller but steadier gains in correctness.
Perhaps the difference here is one of risk sensitivity—similarly to the way a gambler going strictly for long term gains over the largest number of iterations will use the Kelly Criterion, Michael Vassar optimizes for becoming the least wrong when scores are tallied up at the end of the game. Wei Dai would prefer to minimize the volatility of his wrongness instead, taking smaller but steadier gains in correctness.