Ahh, I think I did not think through what “rationality enhancement” might mean; perhaps my own recent search and the AI context of Yudkowsky’s original intent skewed me a little. I was thinking of something like “understanding and applying concepts of rationality” in a way that might include “anticipating misaligned AI” or “anticipating AI-human feedback responses”.
I like the way you’ve framed what’s probably the useful question. I’ll need to think about that a bit more.
Ahh, I think I did not think through what “rationality enhancement” might mean; perhaps my own recent search and the AI context of Yudkowsky’s original intent skewed me a little. I was thinking of something like “understanding and applying concepts of rationality” in a way that might include “anticipating misaligned AI” or “anticipating AI-human feedback responses”.
I like the way you’ve framed what’s probably the useful question. I’ll need to think about that a bit more.