Charlie Steiner comments on AlphaGo Zero and capability amplification

Charlie Steiner 10 Jan 2019 8:15 UTC
1 point
0
This is true when getting training data, but I think it’s a difference between A (or HCH) and AlphaGo Zero when doing simulation / amplification. Someone wins a simulated game of Go even if both players are making bad moves (or even random moves), which gives you a signal that A doesn’t have access to.