I research intelligence and it’s emergence and expression in neural networks to ensure advanced AI is safe and beneficial.
Current interests: neural network interpretability, alignment/safety, unsupervised learning, and deep learning theory.
For more, check out my scholar profile and personal website.
Sorry—fixed! They should match now—I’d forgotten to update the figure in this post. Thanks for pointing it.