According to figure 6b in “Mastering the Game of Go without Human Knowledge”, the raw policy network has 3055 elo, which according to this other page (I have not checked that these Elos are comparable) makes it the 465th best player. (I don’t know much about this and so might be getting the inferences wrong, hopefully the facts are useful)
I don’t think those ratings are comparable. On the other hand, my estimate of 3d was apparently lowballing it based on some older policy networks, and newer ones are perhaps as strong as 4d to 6d, which on the upper end is still weaker than professional players but not by much.
However, there is a big gap between weak professional players and “grandmaster level”, and I don’t think the raw policy network of AlphaGo could play competitively against a grandmaster level Go player.
According to figure 6b in “Mastering the Game of Go without Human Knowledge”, the raw policy network has 3055 elo, which according to this other page (I have not checked that these Elos are comparable) makes it the 465th best player. (I don’t know much about this and so might be getting the inferences wrong, hopefully the facts are useful)
I don’t think those ratings are comparable. On the other hand, my estimate of 3d was apparently lowballing it based on some older policy networks, and newer ones are perhaps as strong as 4d to 6d, which on the upper end is still weaker than professional players but not by much.
However, there is a big gap between weak professional players and “grandmaster level”, and I don’t think the raw policy network of AlphaGo could play competitively against a grandmaster level Go player.