Vika comments on Selection Theorems: A Program For Understanding Agents

Vika 16 Jan 2023 12:22 UTC
LW: 6 AF: 4
AF
I like this research agenda because it provides a rigorous framing for thinking about inductive biases for agency and gives detailed and actionable advice for making progress on this problem. I think this is one of the most useful research directions in alignment foundations since it is directly applicable to ML-based AI systems.