Quadratic Reciprocity comments on The Engineer’s Interpretability Sequence (EIS) I: Intro

Quadratic Reciprocity 4 May 2023 23:41 UTC
2 points
0
I think it’s plausible that too much effort is going to interp at the margin
What’s the counterfactual? Do you think newer people interested in AI safety should be doing other things instead of for example attempting one of the 200+ MI problems suggested by Neel Nanda? What other things?