Hastings answers What evidence is there of LLM’s containing world models?

Hastings 5 Oct 2023 1:13 UTC
3 points
1
GPT-3.5 can play chess at the 1800 elo level, which is terrifying and impossible without at least a chess world model
- Gunnar_Zarncke 13 Jan 2024 12:40 UTC
  4 points
  0
  Parent
  I used to think that it would be very difficult for an LLM to build a model of chess because chess it is not about words and sentences. But a discussion led to the realization that the chess model underlying the chess notation is not that different from long distance referents in (programming) languages. Imagine the 2D chess grid not as a physical board but as a doubly nested array with fixed length (fixed length might make it even easier). GPT clearly can do that. And once it has learned that all higher layers can focus on the rules (though without the MCTS of AlphaZero).
- 1a3orn 5 Oct 2023 11:56 UTC
  4 points
  −2
  Parent
  Note that it also makes illegal moves from rare board states, which means its model of chess is pretty questionable.
  - Daniel Paleka 5 Oct 2023 14:00 UTC
    3 points
    0
    Parent
    I made an illegal move while playing over the board (5+3 blitz) yesterday and lost the game. Maybe my model of chess (even when seeing the current board state) is indeed questionable, but well, it apparently happens to grandmasters in blitz too.
  - Hastings 5 Oct 2023 12:50 UTC
    1 point
    0
    Parent
    I would highly recommend playing against it and trying to get it confused and out of distribution, its very difficult at least for me