rotatingpaguro comments on Found Paper: “FDT in an evolutionary environment”

rotatingpaguro 27 Nov 2023 21:23 UTC
1 point
0
The actions are inferred from the argmax, but they are also inputs to the prediction models.
The actions sui generis being “inputs to the prediction models” does not distinguish CDT from EDT.
(To be continued, leaving now.)
- jacob_cannell 27 Nov 2023 21:57 UTC
  2 points
  0
  Parent
  My understanding is that CDT explicitly disallows acausal predictions—so it disallows models which update on future agent actions themselves, which is important for one boxing.
  
  Action Box Empty Box Full
  
  one_box disallowed allowed
  
  two_box allowed disallowed
  
  In EDT/AIXI the world model is allowed to update the hidden box state conditional on the action chosen, even though this is ‘acausal’. Its equivalent to simply correctly observing that the agent will get higher reward in the subset of the multiverse where the agent decides to one boxe.