wedrifid comments on Newcomb’s Problem and Regret of Rationality

wedrifid 17 May 2011 12:54 UTC
−1 points

Doesn’t have a name as far as I know. But I’m not sure it deserves one; would CDT really be a probable output anywhere besides a verbal theory advocated by human philosophers in our own Everett branch? Maybe, now that I think about it, but even so, does it matter?

It is useful to separate in one’s mind the difference between on one hand being able to One Box and cooperate in PD with agents that you know well (shared source code) and on the other hand not firing on Baby Eaters after they have already chosen not to fire on you. This is especially the case when first grappling the subject. (Could you confirm, by the way, that Akon’s decision in that particular paragraph or two is approximately what TDT would suggest?)

The above is particularly relevant because the “have access to each other’s source code” is such a useful intuition pump when grappling with or explaining the solutions to many of the relevant decision problems. It is useful to be able to draw a line on just how far the source code metaphor can take you.

There is also something distasteful about making comparisons to a decision theory that isn’t even implicitly stable under self modification. A CDT agent will change to CDT++ unless there is an additional flaw in the agent beyond the poor decision making strategy. If I create a CDT agent, give it time to think and then give it Newcomb’s problem it will One Box (and also no longer be a CDT agent). It is the errors in the agent that still remain after that time that need TDT or UDT to fix.

But it will calculate that expected value using CDT!expectation, meaning that it won’t see how self-modifying to be a timeless decision theorist could possibly affect what’s already in the box, etcetera.

*nod* This is just the ‘new rules starting now’ option. What the CDT agent does when it wakes up in an empty, boring room and does some introspection.