The survey doesn’t seem to define what ‘human novice’ performance is. But EfficientZero’s performance curve looks pretty linear in Figure 7 over the 220k frames, finishing at ~1.9x human gametester performance after 2h (6x the allotted time). So presumably at 20min, EfficientZero is ~0.3x 2h-gametester-performance (1.9x * 1⁄6)? That doesn’t strike me as being an improbable level of performance for a novice, so it’s possible that challenge has been met. If not, seems likely that we’re pretty close to it.
The survey doesn’t seem to define what ‘human novice’ performance is. But EfficientZero’s performance curve looks pretty linear in Figure 7 over the 220k frames, finishing at ~1.9x human gametester performance after 2h (6x the allotted time). So presumably at 20min, EfficientZero is ~0.3x 2h-gametester-performance (1.9x * 1⁄6)? That doesn’t strike me as being an improbable level of performance for a novice, so it’s possible that challenge has been met. If not, seems likely that we’re pretty close to it.