Rohin Shah comments on Techniques for optimizing worst-case performance

Rohin Shah 30 Jul 2019 23:08 UTC
LW: 2 AF: 1
AF
That’s fair. I suspect I was assuming a really fast model. Are you imagining that both the model and the specification are quite fast, or that both are relatively slow?
I think my intuition is something like: the number of queries to the model / specification required to obtain worst-case guarantees is orders of magnitude more than the number of queries needed to train the model, and this ratio gets worse the more complex your environment is. (Though if the training distribution is large enough such that it “covers everything the environment can do”, the intuition becomes weaker, mostly because this makes the cost of training much higher.)
- paulfchristiano 31 Jul 2019 15:47 UTC
  LW: 4 AF: 2
  AF Parent
  the number of queries to the model / specification required to obtain worst-case guarantees is orders of magnitude more than the number of queries needed to train the model, and this ratio gets worse the more complex your environment is
  Not clear to me whether this is true in general—if the property you are specifying is in some sense “easy” to satisfy (e.g. it holds of the random model, holds for some model near any given model), and the behavior you are training is “hard” (e.g. requires almost all of the model’s capacity) then it seems possible that verification won’t add too much.
  - Rohin Shah 31 Jul 2019 16:44 UTC
    LW: 4 AF: 2
    AF Parent
    Yeah, I agree it is not always true and so isn’t a clear obstruction.