I think human go players consider points ahead as part of the situation and still don’t just play a move that provides no benefit but costs points in the endgame.
We are not talking about situation where there’s any benefit to be gained from the behavior as that behavior happened in situations that can be fully read out.
There are situations in Go where you don’t start a fight that you expect to win with 95% because you already ahead on the board and the 5% might make you lose but that’s very far from the moves of AlphaGo that I was talking about.
AlphaGo plays moves that are bad according to any pattern of what’s good in Go when it’s ahead.
I think human go players consider points ahead as part of the situation and still don’t just play a move that provides no benefit but costs points in the endgame.
We are not talking about situation where there’s any benefit to be gained from the behavior as that behavior happened in situations that can be fully read out.
There are situations in Go where you don’t start a fight that you expect to win with 95% because you already ahead on the board and the 5% might make you lose but that’s very far from the moves of AlphaGo that I was talking about.
AlphaGo plays moves that are bad according to any pattern of what’s good in Go when it’s ahead.
I feel like it’s pretty relevant that AlphaGo is the worst super-human go bot, and I don’t think better bots have this behaviour.
Last I heard, Leela Zero still tended to play slack moves in highly unbalanced late-game situations.