paulfchristiano comments on Techniques for optimizing worst-case performance

paulfchristiano 30 Jul 2019 21:31 UTC
LW: 4 AF: 2
AF
The argument that we can only focus on the training data makes the assumption that the AI system is not going to generalize well outside of the training dataset.
I’m not intending to make this assumption. The claim is: parts of your model that exhibit intelligence need to do something on the training distribution, because “optimize to perform well on the training distribution” is the only mechanism that makes the model intelligent.
- Rohin Shah 30 Jul 2019 23:13 UTC
  LW: 5 AF: 3
  AF Parent
  That makes sense and rereading the post the transparency section is clearer now, thanks! If I had to guess what gave me the wrong impression before, it would be this part:
  its behavior can only be intelligent when it is exercised on the training distribution
  I suspect when I read this, I thought it implied “when it is not on the training distribution, its behavior cannot be intelligent”.
  - rmoehn 1 Aug 2019 7:13 UTC
    1 point
    Parent
    I also had trouble understanding that sub-clause. Maybe we read it in our head with the wrong emphasis:
    
    its behavior can only be intelligent when it is exercised on the training distribution
    
    Meaning: The agent gets inputs that are within the training distribution. ↔ The agent behaves intelligently.
    
    But I guess it’s supposed to be:
    
    its behavior can only be intelligent when it is exercised on the training distribution
    
    Meaning: A behaviour is intelligent. ↔ The behaviour was exercised during training on the training distribution.