Quintin Pope comments on Deep Learning Systems Are Not Less Interpretable Than Logic/Probability/Etc

Quintin Pope 4 Jun 2022 7:00 UTC
22 points
0
Interesting. Do you know if such approaches have scales to match current SOTA models? My guess would be that, if you had a decision tree that approximated e.g., GPT-3, that it wouldn’t be very interpretable either.

Of course, you could look at any give decision and backtrace it through the tree, but I think it would still be very difficult to, say, predict what the tree will do in novel circumstances without actually running the tree. And you’d have next to no idea what the tree would do in something like a chain of thought style execution where the tree solves a problem step by step, feeding intermediate token predictions back into itself until it produced an answer.

Also, it looks like diagram (a) is actually an approximated random forest, not a neural net.
- Darmani 4 Jun 2022 10:36 UTC
  7 points
  Parent
  I do not. I mostly know of this field from conversations with people in my lab who work in this area, including Osbert Bastani. (I’m more on the pure programming-languages side, not an AI guy.) Those conversations kinda died during COVID when no-one was going into the office, plus the people working in this area moved onto their faculty positions.
  
  I think being able to backtrace through a tree counts as victory, at least in comparison to neural nets. You can make a similar criticism about any large software system.
  
  You’re right about the random forest there; I goofed there. Luckily, I also happen to know of another Osbert paper, and this one does indeed do a similar trick for neural nets (specifically for reinforcement learning); https://proceedings.neurips.cc/paper/2018/file/e6d8545daa42d5ced125a4bf747b3688-Paper.pdf
- johnswentworth 4 Jun 2022 14:13 UTC
  6 points
  Parent
  I endorse this answer.

Quintin Pope comments on Deep Learning Systems Are Not Less Interpretable Than Logic/​Probability/​Etc

Quintin Pope comments on Deep Learning Systems Are Not Less Interpretable Than Logic/Probability/Etc