evhub comments on Why Neural Networks Generalise, and Why They Are (Kind of) Bayesian

evhub 3 Jan 2021 1:41 UTC
LW: 4 AF: 3
AF

The Levin bound doesn’t apply directly to neural networks, because it assumes that P is finite and discrete, but it gives some extra backing to the intuition above.

In what sense is the parameter space of a neural network not finite and discrete? It is often useful to understand floating-point values as continuous, but in fact they are discrete such that it seems like a theorem which assumes discreteness would still apply.
- Joar Skalse 5 Jan 2021 20:12 UTC
  LW: 3 AF: 3
  AF Parent
  Yes, it does of course apply in that sense.
  I guess the question then basically is which level of abstraction we think would be the most informative or useful for understanding what’s going on here. I mean, we could for example also choose to take into account the fact that any actual computer program runs on a physical computer, which is governed by the laws of electromagnetism (in which case the parameter-space might count as continuous again).
  I’m not sure if accounting for the floating-point implementation is informative or not in this case.
  - evhub 6 Jan 2021 0:37 UTC
    LW: 2 AF: 2
    AF Parent
    My understanding is that floating-point granularity is enough of a real problem that it does sometimes matter in realistic ML settings, which suggests that it’s probably a reasonable level of abstraction on which to analyze neural networks (whereas any additional insights from an electromagnetism-based analysis probably never matter, suggesting that’s not a reasonable/useful level of abstraction).