Manfred comments on Example decision theory problem: “Agent simulates predictor”

Manfred 19 May 2011 18:05 UTC
0 points
If your proof works, I would expect that Omega also knows the agent is consistent and can follow the same logic, and so the UDT agent two-boxes on Newcomb’s problem. Unless you use a version of UDT that (effectively) optimizes over decisions rather than actions (like TDT), which would solve both problems.

EDIT: On solving both problems: my understanding of UDT comes from AlephNeil’s post. If you look at his “generalization 2,” it is exactly what I mean by a problem where you need to optimize over decisions rather than actions—and he claims that a UDT agent does so 5 diagrams later—that is, treats the action as also “controlling telepathic robots.”

So going by that understanding of UDT, cousinIt’s proof is incorrect. If we can find proofs shorter than N, we can treat the predictor as a “robot,” and so two-boxing is regarded as worse than one-boxing if it “controls the robot” into not filling the box. So to predict the agent’s action we probably need to fully define the problem—what does the predictor do in cases with no proofs and in cases where the agent’s action depends on the predictor’s?
- wedrifid 19 May 2011 18:09 UTC
  1 point
  Parent
  
  If your proof works, I would expect that Omega also knows the agent is consistent and can follow the same logic
  
  Omega knows everything but unfortunately he isn’t available right now. We are stuck with a far more limited predictor.
  - Manfred 19 May 2011 18:16 UTC
    0 points
    Parent
    The reference to Omega comes from contrasting this post with prior claims that a UDT agent will one-box on newcomb’s problem.