I’m really excited about this program! Super curious to see what comes out of it—I expect I’ll learn a lot whether it goes well, or struggles to get traction. And I want to see more of this kind of ambitious scalable alignment effort!
If you’re interested in getting into mechanistic interpretability work, you should definitely apply to it
I’m really excited about this program! Super curious to see what comes out of it—I expect I’ll learn a lot whether it goes well, or struggles to get traction. And I want to see more of this kind of ambitious scalable alignment effort!
If you’re interested in getting into mechanistic interpretability work, you should definitely apply to it