But if you are attempting to convey the plausibility of inner misalignment, or a mental model of inner misalignment, why not choose instead to analogize the situation to within-lifetime learning among humans?
This prompted me to think about this analogy for a few hours, and writing down my thoughts here. Would be interested to know if you have any comments on my comments.
Also, I think this serves as a positive example for arguing by analogy, showing that it’s possible to make intellectual progress this way, going from evolution-as-alignment to within-lifetime-learning-as-alignment to my latest analogy (in the above comment) of evolution-as-alignment-researcher, each perhaps contributing to a better overall understanding.
This prompted me to think about this analogy for a few hours, and writing down my thoughts here. Would be interested to know if you have any comments on my comments.
Also, I think this serves as a positive example for arguing by analogy, showing that it’s possible to make intellectual progress this way, going from evolution-as-alignment to within-lifetime-learning-as-alignment to my latest analogy (in the above comment) of evolution-as-alignment-researcher, each perhaps contributing to a better overall understanding.