Do you think that solving Starcraft (by self-play) will require some major insight or will it be just a matter of incremental improvement of existing methods?
I don’t think it will require any new insight. It might require using slightly different algorithms—better techniques for scaling, different architectures to handle incomplete information, maybe a different training strategy to handle the very long time horizons; if they don’t tie their hands it’s probably also worth adding on a bunch of domain-specific junk.
Do you think that solving Starcraft (by self-play) will require some major insight or will it be just a matter of incremental improvement of existing methods?
I don’t think it will require any new insight. It might require using slightly different algorithms—better techniques for scaling, different architectures to handle incomplete information, maybe a different training strategy to handle the very long time horizons; if they don’t tie their hands it’s probably also worth adding on a bunch of domain-specific junk.