My understanding is that CDT explicitly disallows acausal predictions—so it disallows models which update on future agent actions themselves, which is important for one boxing.
Action
Box Empty
Box Full
one_box
disallowed
allowed
two_box
allowed
disallowed
In EDT/AIXI the world model is allowed to update the hidden box state conditional on the action chosen, even though this is ‘acausal’. Its equivalent to simply correctly observing that the agent will get higher reward in the subset of the multiverse where the agent decides to one boxe.
The actions sui generis being “inputs to the prediction models” does not distinguish CDT from EDT.
(To be continued, leaving now.)
My understanding is that CDT explicitly disallows acausal predictions—so it disallows models which update on future agent actions themselves, which is important for one boxing.
In EDT/AIXI the world model is allowed to update the hidden box state conditional on the action chosen, even though this is ‘acausal’. Its equivalent to simply correctly observing that the agent will get higher reward in the subset of the multiverse where the agent decides to one boxe.