Vanessa Kosoy comments on [missing post]

Vanessa Kosoy 15 Jul 2024 5:56 UTC
2 points
0
Let’s view each accessible action space $A (s)$ as the set of randomized policies over $V (A (s))$ .
Seems worth to clarify that this representation is non-unique: multiple distribution over V(A) can correspond to the same point in A.