The agent doesn’t use enough steps to simulate the predictor, it decides early (because it finds a proof that predictor conditionally one-boxes early), which is also what might allow the predictor to conditionally predict agent’s one-boxing within predictor’s limited computational resources. The M steps where the agent protects the predictor from being unconditionally predictable is a very small number here, compared to agent’s potential capability.
The agent doesn’t use enough steps to simulate the predictor, it decides early (because it finds a proof that predictor conditionally one-boxes early), which is also what might allow the predictor to conditionally predict agent’s one-boxing within predictor’s limited computational resources. The M steps where the agent protects the predictor from being unconditionally predictable is a very small number here, compared to agent’s potential capability.