Vladimir_Nesov comments on Behaviour Manifolds and the Hessian of the Total Loss—Notes and Criticism

Vladimir_Nesov 3 Sep 2022 2:49 UTC
LW: 3 AF: 2
0
AF
I worry that using $O^{D}$ as the space of behaviors misses something important about the intuitive idea of robustness, making any conclusions about $F$ or $O^{D}$ or behavior manifolds harder to apply. A more natural space (to illustrate my point, not as something helpful for this post) would be $O^{R^{n}}$ , with a metric that cares about how outputs differ on inputs that fall within a particular base distribution $γ$ , something like $d (g, h) = E_{X \sim γ} | g (x) - h (x) | .$

The issue with $O^{D}$ is that models in a behavior manifold only need to agree on the training inputs, and always include all models with arbitrarily crazy behaviors at all inputs outside the dataset, even if we are talking about inputs very close to those in the dataset (which is what $γ$ above is supposed to prevent). So the behavior manifolds are more like cylinders than balls, ignoring crucial dimensions. Since generalization does work (so learning tends to find very unusual points of them), it’s generally unclear how a behavior manifold as a whole is going to be relevant to what’s actually going on.
- carboniferous_umbraculum 4 Sep 2022 17:02 UTC
  LW: 2 AF: 2
  1
  AF Parent
  I agree that the space $O^{D}$ may well miss important concepts and perspectives. As I say, it is not my suggestion to look at it, but rather just something that was implicitly being done in another post. The space $O^{R^{n}}$ may well be a more natural one. (It’s of course the space of functions $R^{n} \to O$ , and so a space in which ‘model space’ naturally sits in some sense. )