Garrett Baker comments on On Frequentism and Bayesian Dogma

Garrett Baker 18 Oct 2023 5:34 UTC
2 points
0
From another comment of mine:

The prior assigns uniform probability to all weights, and I believe a good understanding of the mapping from weights to functions is unknown, though lots of the time there are many directions you can move in in the weight space which don’t change your function, so one would expect its a relatively compressive mapping (in contrast to, say, a polynomial parameterization, where the mapping is one-to-one).

Also, side-comment: Thanks for the discussion! Its fun.

EDIT: Actually, there should be a term for the stochasticity which you integrate into the SLT equations like you would temperature in a physical system. I don’t remember exactly how this works though. Or if its even known the exact connection with SGD.