Sho and I want to thank jylin04 for this really nice post and endorse the distillation of our key results in her 8-page summary. We also agree that it would be interesting to make further connections between our work—in particular the effective theory framework—and interpretability, and we’d be really glad to explore and discuss that further.
Sho and I want to thank jylin04 for this really nice post and endorse the distillation of our key results in her 8-page summary. We also agree that it would be interesting to make further connections between our work—in particular the effective theory framework—and interpretability, and we’d be really glad to explore and discuss that further.