Later in the post, I proposed a similar modification:
I think we should modify the simplified hand-off procedure I described above so that, during training:
A range of handoff thresholds and pproportions are drawn—in particular, there should be a reasonable probability of drawing pvalues close to 0, close to 1, and also 0 and 1 exactly.
The human net runs for pnsteps before calling the reporter.
Later in the post, I proposed a similar modification: