P(xi) is a polytope with P(xi)⊆ΔA, corresponding to allowed action distributions at that state.
I think it’s mathematically cleaner to get rid of A and have those be abstract polytopes.
Sounds good
I think it’s mathematically cleaner to get rid of A and have those be abstract polytopes.
Sounds good