This asymmetry makes a lot of sense from an efficiency standpoint. No sense wasting your limited storage/​computation on state(-action pair)s that you are also simultaneously preventing yourself from encountering.
This asymmetry makes a lot of sense from an efficiency standpoint. No sense wasting your limited storage/​computation on state(-action pair)s that you are also simultaneously preventing yourself from encountering.