Hastings comments on Instrumental Convergence Bounty

Hastings 15 Sep 2023 0:41 UTC
35 points
3
Hi! I might have something close. The chess engine Stockfish has a heuristic for what it wants, with manually specified values for how much it wants to keep its bishops, or open up lines of attack, or connect its rooks, etc. I tried to modify this function to make it want to advance the king up the board, by adding a direct reward for every step forward the king takes. At low search depths, this leads it to immediately move the king forward, but at high search depths it mostly just attacks the other player in order to make it safe to move the king, ( best defense is offense) and only starts moving the king late in the game. I wasn’t trying to demonstrate instrumental convergence, in fact this behavior was quite annoying as it was ruining my intented goal (creating fake games demonstrating the superiority of the bongcloud opening)

modified stockfish: https://github.com/HastingsGreer/stockfish
This was 8 years ago, so I’m fuzzy on the details. If it sounds like vaguely what you’re looking for, reply to let me know and I’ll write this up with some example games and make sure the code still runs.
What links here?
- Instrumental Convergence Bounty by Logan Zoellner (14 Sep 2023 14:02 UTC; 62 points)
- Logan Zoellner 15 Sep 2023 12:42 UTC
  5 points
  0
  Parent
  This sounds like exactly the type of thing I’m looking for.
  
  Do you know, in the “convergent” case, is the score just for advancing the king, or is there a possibility that there’s some boring math explanation like “as the number of steps calculated increases, the weights on the other positions (bishops, lines of attack, etc) overwhelms the score for the king advancing”?
  - Hastings 15 Sep 2023 18:31 UTC
    4 points
    1
    Parent
    I think I fully lobotomized the evaluation function to only care about advancing the king, except that it still evaluates checkmate as +infinty. Here’s a sample game:
    https://www.chesspastebin.com/view/24278
    
    It doesn’t really understand material anymore except for the queen, which I guess is powerful enough that it wants to preserve it to allow continued king pushing. I still managed to lose because I’m not very good at chess.
    
    EDIT: the installation guide I listed below was too many steps and involved blindly trusting internet code, that’s silly. Instead just threw it up on lichess and you can play it in the browser here: https://lichess.org/@/king-forward-bot
    
    ~~If you want to play yourself,~~ you can compile the engine with
    
    git clone https://github.com/HastingsGreer/stockfish
    cd stockfish/src
    make
    
    and then can install a gui like xboard (on mac, brew install xboard) and add your stockfish binary as an UCI engine.
    - Logan Zoellner 15 Sep 2023 22:01 UTC
      3 points
      0
      Parent
      This looks great!
      I will DM you to figure out how to send the bounty.
    - gwern 15 Sep 2023 20:10 UTC
      3 points
      0
      Parent
      It would be more elegant to remove the checkmate penalty. After all, checkmate is instrumentally bad, because on the next move, the king is captured and can no longer move forward any spaces—unless he’s advanced to the final row* and so has maximized his movement score, and so why care about what happens after that? Nothing will increase (or decrease) his movement reward.
      
      * I take by ‘advancing the king’ you mean to ‘moving to a new unvisited’ row, so it maxes at 8, for the 8 rows of the chess board, and not simply moving at all, since that would have degenerate solutions like building a fortress and then the king simply moving back and forth between two spaces for eternity.
      - Hastings 15 Sep 2023 20:18 UTC
        1 point
        0
        Parent
        I agree! The stockfish codebase handles evaluation of checkmates somewhere else in the code, so that would be a bit more work, but it’s definitely the correct next step.
- Insub 15 Sep 2023 21:40 UTC
  3 points
  0
  Parent
  That’s great. “The king can’t fetch the coffee if he’s dead”