I think GPT-N is definitely not aligned, for mesa-optimizer reasons. It’ll be some unholy being with a superhuman understanding of all the different types of humans, all the different parts of the internet, all the different kinds of content and style… but it won’t itself be human, or anything close.
Of course, it’s also not outer-aligned in Evan’s sense, because of the universal prior being malign etc.
I think GPT-N is definitely not aligned, for mesa-optimizer reasons. It’ll be some unholy being with a superhuman understanding of all the different types of humans, all the different parts of the internet, all the different kinds of content and style… but it won’t itself be human, or anything close.
Of course, it’s also not outer-aligned in Evan’s sense, because of the universal prior being malign etc.