If the implementation of the memory is airtight (e.g. the memory is perfect, the memory cannot be hacked into) then I would say “an AI a preference over future states including a memory of the trajectory” is an implementation approach for building an AI with a preference over trajectories. It’s probably not a good implementation approach in practice, but it is an implementation method in principle. :-P
To what extent is preference over trajectories indistinguishable from preference over future states including a memory of the trajectory?
If the implementation of the memory is airtight (e.g. the memory is perfect, the memory cannot be hacked into) then I would say “an AI a preference over future states including a memory of the trajectory” is an implementation approach for building an AI with a preference over trajectories. It’s probably not a good implementation approach in practice, but it is an implementation method in principle. :-P
More in this comment.