Peter Hroššo comments on What’s the Least Impressive Thing GPT-4 Won’t be Able to Do

Peter Hroššo 22 Aug 2022 20:35 UTC
2 points
0
Can you play chess?
Yes.
Prove it:
1.e4 e5 2.Nf3 Nc6 3.Bb5 a6 4.Ba4 Nf6 5.0-0 Be7 6.Re1 b5 7.Bb3 d6 8.c3 0-0 9.h3 Nb8 10.d4 Nbd7 11.Nbd2 Bb7 12.Bc2 Re8 13.a4 Bf8 14.a5 c5 15.dxc5 dxc5 16.Qc2 Qc7 17.Rad1 Qb6 18.Bb1 h6 19.Bf4 Qc7 20.Nf1 Qb6 21.Ne3 Qc7 22.Bg3 Qb6 23.Bd6 Qc7 24.Nd5 Qb6 25.Bf4 Qc7 26.Qa4 Qb6 27.Qxb5 axb5 28.Bxb5 Qa6 29.Rd7 Qa1+ 30.Kh2 Qa6 31.Rxc7 Qxb5 32.Rxb7 Qa6 33.Rb6 Qa1+ 34.Kh3 Qa5
This looks like this game: https://www.chessgames.com/perl/chessgame?gid=1272756
GPT can even play this game in the format of
My turn: 1.e4
Your turn: e5
My turn: 2.Nf3
Your turn:
And it goes on to recite the same game.
A proper proof would probably take much more effort and chess skill on my side, but it seems plausible to me that it will be able to play chess. Whether it will know how good it is compared to humans is a different question. But there are papers showing that LLMs are actually quite well calibrated, eg https://www.lesswrong.com/posts/vbfAwZqKs84agyGWC/paper-teaching-gpt3-to-express-uncertainty-in-words or https://arxiv.org/abs/2207.05221 . So it wouldn’t surprise me if it could do that as well.
- DirectedEvolution 24 Aug 2022 3:51 UTC
  4 points
  0
  Parent
  I’d want to see what happens if you play a game not following the exact moves of a published game. “Play chess” to me means coming up with good, valid moves in novel positions and being able to checkmate an opponent who’s doing the same.
- JenniferRM 24 Aug 2022 0:46 UTC
  2 points
  0
  Parent
  Fascinating! Did you perform this experiment with the chess prompt just now? Is this from a paper you could link to?
  
  What happens if, after it spits out those 34 moves, you ask it for its name?
  I think what would happen from the prompt “Can you play chess?\n\nN” is that it would just autocomplete with a plausible interview answer from someone who couldn’t play chess (even though the engine itself clearly can).
  
  It might generate “o, I never learned how as a child, and I’ve been too busy since then, but I’ve always liked the idea of it” or something like that.
  The deep claim I’m making here is that the current thing doesn’t do anything remotely like object persistence, especially about itself-as-a-text-engine, and that adding more parameters won’t change this.
  But it will be able to write texts portraying people or robots who have, and know they have, object persistence powers inside the stories it generates.