Hm. I expect that within the set of environments where exploitation can alter the results of what-to-exploit-next calculations, there more possible ways for it to do so such that the right move in the next iteration is further exploration than further exploitation.
So, yeah, I’ll accept “will get you better results on average.”
OK, understood. Thanks for clarifying.
Hm. I expect that within the set of environments where exploitation can alter the results of what-to-exploit-next calculations, there more possible ways for it to do so such that the right move in the next iteration is further exploration than further exploitation.
So, yeah, I’ll accept “will get you better results on average.”