I research intelligence and it’s emergence and expression in neural networks to ensure advanced AI is safe and beneficial.
Current interests: neural network interpretability, alignment/safety, unsupervised learning, and deep learning theory.
For more, check out my scholar profile and personal website.