with either a single action (e.g., “move right”) or multiple actions until hard drop
Is it possible to move partially down before moving sideways? If yes, and if the models are playing badly, then doing so is usually a bad move, since it gives an opportunity for a piece to land on a ledge higher up. If the multiple action variant encourages hard drops, it will perform better.
The multiple action variant also lets the model know where the current piece is, which it can’t reliably understand from the board image.
The optimal strategy, given the model’s effective blindness, is to build 3 towers, left right and center. The model might even suggest that strategy itself, if you ask it to brainstorm.
Is it possible to move partially down before moving sideways? If yes, and if the models are playing badly, then doing so is usually a bad move, since it gives an opportunity for a piece to land on a ledge higher up. If the multiple action variant encourages hard drops, it will perform better.
The multiple action variant also lets the model know where the current piece is, which it can’t reliably understand from the board image.
The optimal strategy, given the model’s effective blindness, is to build 3 towers, left right and center. The model might even suggest that strategy itself, if you ask it to brainstorm.