I can believe that it’s possible to defeat a Go professional by some extremely weird strategy that causes them to have a seizure or something in that spirit. But, is there a way to do this that another human can learn to use fairly easily? This stretches credulity somewhat.
While I can’t say anything about Go, but con artists, social engineers, shoplifters, hypnotisers, cult recruiters etc, know many tricks designed to nudge you towards predictable mistakes.
What’s interesting to me, it’s that our first useful for pivotal act superintelligent AGI probably will have multiple such holes and it can be useful for a third-line-defence corrigibility.
While I can’t say anything about Go, but con artists, social engineers, shoplifters, hypnotisers, cult recruiters etc, know many tricks designed to nudge you towards predictable mistakes.
What’s interesting to me, it’s that our first useful for pivotal act superintelligent AGI probably will have multiple such holes and it can be useful for a third-line-defence corrigibility.