Error

LW server reports: not allowed.

This probably means the post has been deleted or moved back to the author's drafts.

Vanessa Kosoy 15 Jul 2024 6:58 UTC
2 points
0
let $k_{s^{'}} : P_{j} \to R^{U_{⊤}}$ be a helper function that maps each $a \in V (P_{i})$ to $e_{⊤} (s^{'}, a)$ .
This function is ill-defined outside the vertices.
Vanessa Kosoy 15 Jul 2024 6:37 UTC
2 points
0
or $s := (s^{'}, s^{''}) \in S_{\land}$ , we want $P_{\land} (s) := P_{σ_{⊤} (s^{'})} (s^{''})$ and $A_{\land} (s) := A_{σ_{⊤} (s^{'})} (s^{''})$ , so that the actions available are just those of the state in the sub-environment. To achieve this we define $σ_{\land} (s) := σ_{σ_{⊤} (s^{'})} (s^{''})$
It seems that you’re using Ai and Pi to denote both the action spaces of the top environments and the action space assignment functions of the bottom environments. In addition, there is an implicit assumption that the bottom environments share the same list of action spaces. This is pretty confusing.
- Arjun Pitchanathan 30 Jul 2024 17:28 UTC
  1 point
  0
  Parent
  I’m not. I guess this is the part that makes it confusing
  for readability we define $A (s) := A_{σ (s)}$ and $P_{σ (s)}$ to be the accessible and outer action spaces of $s$ respectively
  Do you have a suggestion for alternate notation? I use this because we often need to refer to the action space corresponding to a state. I think this would be needed even with the language framing.
  (I also assigned $j := σ (s^{'})$ to make it more readable)
  - Vanessa Kosoy 3 Aug 2024 11:18 UTC
    2 points
    0
    Parent
    You can use e.g. subscripts to refer to indices of the action space list and superscripts to refer to indices of the subenvironment list.
    - Arjun Pitchanathan 6 Aug 2024 19:43 UTC
      1 point
      0
      Parent
      I don’t think this will work because we are already using subscripts to denote which environment’s list we are referring to
      - Vanessa Kosoy 7 Aug 2024 7:06 UTC
        2 points
        0
        Parent
        Yes, my point is that currently subscripts refer to both subenvironments and entries in the action space list. I suggest changing one of these two into superscripts.
Vanessa Kosoy 15 Jul 2024 6:29 UTC
2 points
0
such that $f_{i} ((1 - γ_{⊥}) Q_{i}) = A (E_{i})$
Is A(Ei) supposed to be just Ai?
Vanessa Kosoy 15 Jul 2024 6:17 UTC
2 points
0
μ×:=μ1×⋯×μk×δ
Unclear what delta is here. Is it supposed to be p?
Vanessa Kosoy 15 Jul 2024 6:06 UTC
2 points
0
An atomic environment is constructed by directly providing
The transition kernel is missing from this list.
Vanessa Kosoy 15 Jul 2024 6:04 UTC
2 points
0
- a vector space $W$ and linear maps $g : R^{U} \to W$ and $R^{Q} : W \to R$ such that for any $q \in Q$ , $R^{Q} (q) = {max}_{ν \in M s.t. g (ν) = q} R^{ν} (ν)$ .
- a H-polytope $Q := g (M) \subseteq W$ that we call the occupancy polytope
Confusing: you’re using Q before you defined it. Also, instead of writing “s.t.” in the subscript, you can write “:”
Vanessa Kosoy 15 Jul 2024 5:56 UTC
2 points
0
Let’s view each accessible action space $A (s)$ as the set of randomized policies over $V (A (s))$ .
Seems worth to clarify that this representation is non-unique: multiple distribution over V(A) can correspond to the same point in A.
Vanessa Kosoy 15 Jul 2024 5:50 UTC
2 points
1
where each $A_{i}$ and $P_{i}$ is an HV-polytope
Too restrictive. P can be an H-polytope, doesn’t need to be an HV-polytope.
Vanessa Kosoy 15 Jul 2024 5:35 UTC
2 points
0
efficiently^[1]
The footnote is missing