One more thing I should probably have added: I am only talking about the distributional shift in input data, which is important. But I think Eliezer is also talking about another kind of distributional shift that comes from a change in ontology. I am confused about how to think of this. Intuitively it is “the world hasn’t changed, just how I look at it”, whereas I discuss “the world has changed” (because the agent is doing things that haven’t occurred during training).
One more thing I should probably have added: I am only talking about the distributional shift in input data, which is important. But I think Eliezer is also talking about another kind of distributional shift that comes from a change in ontology. I am confused about how to think of this. Intuitively it is “the world hasn’t changed, just how I look at it”, whereas I discuss “the world has changed” (because the agent is doing things that haven’t occurred during training).