Hi jylin04. Fantastic post! It touches on many more aspects of interpretability than my post about the book. I also enjoyed your summary PDF!
I’d love to contribute to any theory work in this direction, if I can. Right now I’m stuck around p. 93 of the book. (I’ve read everything, but I’m now trying to re-derive the equations and have trouble figuring out where a certain term goes. I am also building a Mathematica package that takes care of some of the more tedious parts of the calculations.) Maybe we could get in touch?
Hi jylin04. Fantastic post! It touches on many more aspects of interpretability than my post about the book. I also enjoyed your summary PDF!
I’d love to contribute to any theory work in this direction, if I can. Right now I’m stuck around p. 93 of the book. (I’ve read everything, but I’m now trying to re-derive the equations and have trouble figuring out where a certain term goes. I am also building a Mathematica package that takes care of some of the more tedious parts of the calculations.) Maybe we could get in touch?