Consider the following decision problem which I call the “UDT anti-Newcomb problem”. Omega is putting money into boxes by the usual algorithm, with one exception. It isn’t simulating the player at all. Instead, it simulates what would a UDT agent do in the player’s place.
This was one of my problematic problems for TDT. I also discussed some Sneaky Strategies which could allow TDT, UDT or similar agents to beat the problem.
Hi drnickbone, thx for pointing this out! I added links to your posts.