We can define the world as having the Markov property, i.e. as a Markov process. But when we split the world into an agent and its environment, we lose the Markov property for each of them separately.
I’m using non-standard notation and terminology because they are needed for the theory I’m developing in these posts. In future posts I’ll try to link more to the handful of researchers who do publish on this theory. I did publish one post relating the terminology I’m using to more standard research.
I explained this in my non-standard introduction to reinforcement learning.
We can define the world as having the Markov property, i.e. as a Markov process. But when we split the world into an agent and its environment, we lose the Markov property for each of them separately.
I’m using non-standard notation and terminology because they are needed for the theory I’m developing in these posts. In future posts I’ll try to link more to the handful of researchers who do publish on this theory. I did publish one post relating the terminology I’m using to more standard research.