Thanks for the link. Their algorithm, the “multiplicative update rule,” which goes about “selecting each arm randomly with probabilities that evolve based on their past performance,” does not seem to me to be the same strategy as Eliezer describes. So does this contradict his argument?
Thanks for the link. Their algorithm, the “multiplicative update rule,” which goes about “selecting each arm randomly with probabilities that evolve based on their past performance,” does not seem to me to be the same strategy as Eliezer describes. So does this contradict his argument?
Yes.