I research intelligence and it’s emergence and expression in neural networks to ensure advanced AI is safe and beneficial.
Current interests: neural network interpretability, alignment/safety, unsupervised learning, and deep learning theory.
For more, check out my scholar profile and personal website.
Thanks! Yep, makes sense—that’s one of the things we’ll be working on and hope to share some results soon!