Apollo Research (London). My main research interests are mechanistic interpretability and inner alignment.