We’re working on the theta problem at the moment. Basically we’re currently defining interruptibility in terms of convergence to optimality. Hence we need the agent to explore sufficiently, hence we can’t set theta=1. But we want to be able to interrupt the agent in practice, so we want theta to tend to one.
Yup, I think I understand that, and agree you need to at least tend to one. I’m just wondering why you initially use the loser definition of theta (where it doesn’t need to tend to one, and can instead be just 0 )
We’re working on the theta problem at the moment. Basically we’re currently defining interruptibility in terms of convergence to optimality. Hence we need the agent to explore sufficiently, hence we can’t set theta=1. But we want to be able to interrupt the agent in practice, so we want theta to tend to one.
Yup, I think I understand that, and agree you need to at least tend to one. I’m just wondering why you initially use the loser definition of theta (where it doesn’t need to tend to one, and can instead be just 0 )
When defining safe interruptibility, we let theta tend to 1. We probably didn’t specify that earlier, when we were just introducing the concept?