Vivek Hebbar comments on The Geometric Expectation

Vivek Hebbar 24 Nov 2022 6:31 UTC
3 points
2
Which is equivalent to $f^{*} (x) = e x p [\frac{d}{d x} l n (f (x))]$
- scottviteri 13 Apr 2024 6:57 UTC
  3 points
  0
  Parent
  And if I pushed around symbols correctly, the geometric derivative can be pulled inside of a geometric expectation ( $\nabla_{θ}^{*} G_{x \sim P (x)} [f (x)] = G_{x \sim P (x)} [\nabla_{θ}^{*} f (x)]$ ) similarly to how an additive derivative can be pulled inside an additive expectation ( $\nabla_{θ} E_{x \sim P (x)} [f_{θ} (x)] = E_{x \sim P (x)} [\nabla_{θ} f_{θ} (x)]$ ). Also, just as additive expectation distributes over addition ( $E [f (x) + g (x)] = E [f (x)] + E [g (x)]$ ), geometric expectation distributes over multiplication ( $G [f (x) g (x)] = G [f (x)] G [g (x)]$ ).
  - scottviteri 13 Apr 2024 22:54 UTC
    2 points
    0
    Parent
    I think what is going on here is that both $\nabla^{*}$ and $G$ are of the form $(e^{\land}) \circ g \circ ln$ with $g = \nabla$ and $g = E$ , respectively. Let’s define the star operator as $g^{*} = (e^{\land}) \circ g \circ ln$ . Then $(f \circ g)^{*} = (e^{\land}) \circ (f \circ g) \circ ln = (e^{\land}) \circ f \circ ln \circ (e^{\land}) \circ g \circ ln = f^{*} \circ g^{*}$ , by associativity of function composition. Further, if $f$ and $g$ commute, then so do $f^{*}$ and $g^{*}$ : $g^{*} \circ f^{*} = (g \circ f)^{*} = (f \circ g)^{*} = f^{*} \circ g^{*} .$
    So the commutativity of the geometric expectation and derivative fall directly out of their representation as $E^{*}$ and $\nabla^{*}$ , respectively, by commutativity of $E$ and $\nabla$ , as long as they are over different variables.
    
    We can also derive what happens when the expectation and gradient are over the same variables: $(\nabla_{θ} \circ E_{x \sim P_{θ} (x)})^{*}$ . First, notice that $(* k)^{*} (x) = e^{k * ln x} = e^{ln x * k} = x^{k}$ , so $(* k)^{*} = (^{\land} k)$ .. Also $(+ k)^{*} (x) = e^{k + ln (x)} = e^{k} e^{ln (x)} = x e^{k} ⟹ (+ k)^{*} = (* e^{k})$ .
    Now let’s expand the composition of the gradient and expectation. $(\nabla_{θ} \circ E_{x \sim P_{θ} (x)}) (f (x)) = \nabla_{θ} \int P_{θ} (x) f (x) d x = E_{x \sim P_{θ} (x)} [\nabla_{θ} (f (x) ln P_{θ} (x))]$ , using the log-derivative trick. So $\nabla_{θ} \circ E_{x \sim P_{θ} (x)} = E_{x \sim P_{θ} (x)} \circ \nabla_{θ} \circ (* ln P_{θ} (x))$ .
    Therefore, $\nabla_{θ}^{*} \circ G_{x \sim P_{θ} (x)} = (\nabla_{θ} \circ E_{x \sim P_{θ} (x)})^{*}$ $= E_{x \sim P_{θ} (x)}^{*} \circ \nabla_{θ}^{*} \circ (* ln P_{θ} (x))^{*}$ $= G_{x \sim P_{θ}} \circ \nabla_{θ}^{*} \circ (^{\land} ln P_{θ})$ .
    Writing it out, we have $\nabla_{θ}^{*} G_{x \sim P_{θ} (x)} [f (x)] = G_{x \sim P_{θ} (x)} [\nabla_{θ}^{*} (f (x)^{ln P_{θ} (x)}]$ .