If you aren’t forced to learn all the ways of doing the task, then you should expect the neural net to learn only one of the ways. So maybe it’s that the adversarial nature of OpenAI Five forced it to learn all the ways, and it was then paradoxically easier to remember all of the ways than just one of the ways.
Intuitively, if you forget how to do something one way, but you remember how to do it other ways, then that could make figuring out the other way again easier, thought I don’t have a reason to suspect that would be the case for NNs/etc, and might depend on the specifics of the task.
Unclear, seems like it could go either way.
If you aren’t forced to learn all the ways of doing the task, then you should expect the neural net to learn only one of the ways. So maybe it’s that the adversarial nature of OpenAI Five forced it to learn all the ways, and it was then paradoxically easier to remember all of the ways than just one of the ways.
Intuitively, if you forget how to do something one way, but you remember how to do it other ways, then that could make figuring out the other way again easier, thought I don’t have a reason to suspect that would be the case for NNs/etc, and might depend on the specifics of the task.