If you only look at the loss of the worst experiment (so the maximum CaSc loss rather than the average one) you don’t get these kind of cancellation problems
I think this “max loss” procedure is different from what Buck wrote and the same as what I wrote.
I think this “max loss” procedure is different from what Buck wrote and the same as what I wrote.