I don’t know the answer to your actual question, but I’ll note there are slightly fewer mech interp mentors than mentors listed in the “AI interpretability” area (though all of them are at least doing “model internals”). I’d say Stephen Casper and I aren’t focused on interpretability in any narrow sense, and Nandi Schoots’ projects also sound closer to science of deep learning than mech interp. Assuming we count everyone else, that leaves 11 out of 39 mentors, which is slightly less than ~8 out of 23 from the previous cohort (though maybe not by much).
I don’t know the answer to your actual question, but I’ll note there are slightly fewer mech interp mentors than mentors listed in the “AI interpretability” area (though all of them are at least doing “model internals”). I’d say Stephen Casper and I aren’t focused on interpretability in any narrow sense, and Nandi Schoots’ projects also sound closer to science of deep learning than mech interp. Assuming we count everyone else, that leaves 11 out of 39 mentors, which is slightly less than ~8 out of 23 from the previous cohort (though maybe not by much).