Charlie Steiner comments on If influence functions are not approximating leave-one-out, how are they supposed to help?

Charlie Steiner 22 Sep 2023 20:43 UTC
4 points
0
I feel like there’s also a Bayesian NN perspective on the PBRF thing. It has some ingredients that look like a Gaussian prior (L2 regularization), and an update (which would have to be the combination of the first two ingredients—negative loss function on a single datapoint but small overall difference in loss).
Said like this, it’s obvious to me that this is way different from leave-one-out. First learning to get low loss at a datapoint and then later learning to get high loss there is not equivalent to never learning anything directly about it.