Shameless self-plug: Similarly, if anyone wants to discuss automating alignment research, I’m in the process of building an organization to make that happen. I’m reaching out to Logan because I have a project in mind regarding automating interpretability research (e.g. making AIs run experiments that try to make DL models more interpretable), and he’s my friend! My goal for the org is to turn it into a three-year moonshot to solve alignment. I’d be happy to chat with anyone who would be interested in chatting further about this (I’m currently testing fit with potential co-founders and seeking a cracked basement CTO).
I sent an invite, Logan! :)
Shameless self-plug: Similarly, if anyone wants to discuss automating alignment research, I’m in the process of building an organization to make that happen. I’m reaching out to Logan because I have a project in mind regarding automating interpretability research (e.g. making AIs run experiments that try to make DL models more interpretable), and he’s my friend! My goal for the org is to turn it into a three-year moonshot to solve alignment. I’d be happy to chat with anyone who would be interested in chatting further about this (I’m currently testing fit with potential co-founders and seeking a cracked basement CTO).