davidad comments on Worst-case thinking in AI alignment

davidad 27 Dec 2021 11:45 UTC
LW: 6 AF: 2
AF
My interpretation of the NFL theorems is that solving the relevant problems under worst-case assumptions is too easy, so easy it’s trivial: a brute-force search satisfies the criterion of worst-case optimality. So, that being settled, in order to make progress, we have to step up to average-case evaluation, which is harder.
(However, I agree that once we already need to do some averaging, making explicit and stripping down the statistical assumptions and trying to get closer to worst-case guarantees—without making the problem trivial again—is harder than just evaluating empirically against benchmarks.)
What links here?
- Noosphere89's comment on .CLI’s Shortform by .CLI (1 Sep 2024 17:34 UTC; 2 points)
- jsteinhardt 27 Dec 2021 16:09 UTC
  LW: 2 AF: 1
  0
  AF Parent
  Finding the min-max solution might be easier, but what we actually care about is an acceptable solution. My point is that the min-max solution, in most cases, will be unacceptably bad.
  
  And in fact, since min_x f(theta,x) ⇐ E_x[f(theta,x)], any solution that is acceptable in the worst case is also acceptable in the average case.
  - davidad 28 Dec 2021 9:02 UTC
    LW: 6 AF: 2
    AF Parent
    Agreed—although optimizing for the worst case is usually easier than optimizing for the average case, satisficing for the worst case is necessarily harder (and, in ML, typically impossible) than satisficing for the average case.