rwallace comments on Ingredients of Timeless Decision Theory

rwallace 19 Aug 2009 17:48 UTC
2 points
0
By hypothesis, Omega on examining your code at 6:59, knows that you will self-modify at 7:00 and one-box thereafter.

Consider that every TDT agent must be derived from a non-TDT agent. There is no difference in principle between “I used to adhere to CDT but self-modified to TDT” and “I didn’t understand TDT when I was a child, but I follow it now as an adult”.

Correction made, thanks to Tim Tyler.
- Eliezer Yudkowsky 22 Aug 2009 21:03 UTC
  1 point
  0
  Parent
  
  By hypothesis, Omega on examining your code at 6:59, knows that you will self-modify at 7:00 and one-box thereafter.
  
  CDT agents don’t care. They aren’t causing Omega to fill box B by changing their source code at 7pm, so they have no reason to change their source code in a way that takes only one box. The source code change only causes Omega to fill box B if Omega looks at their source code after 7pm. That is how CDT agents (unwisely) compute “causes”.
  - rwallace 23 Aug 2009 9:17 UTC
    0 points
    0
    Parent
    Yes, but the CDT agent at seven o’clock is not being asked to choose one or two boxes. It has to choose between rewriting its algorithm to plain TDT (or DBDT or some variant that will one box), or to TDT with an exception clause “but use the old algorithm if you find out Omega’s prediction was made before seven o’clock”. Even by straight CDT, there is no motive for writing that exception.
    - Eliezer Yudkowsky 23 Aug 2009 19:34 UTC
      5 points
      0
      Parent
      
      Even by straight CDT, there is no motive for writing that exception.
      
      This is the point at which I say “Wrong” and “Read the literature”. I’m not sure how I can explain this any more clearly than I have already, barring a full-fledged sequence. At 7pm the CDT agent calculates that if it modifies its source to use the old algorithm in cases where Omega saw the code before 7pm, it will get an extra thousand dollars on Newcomb’s Problem, since it will take box A which contains an additional thousand dollars, and since its decision to modify its code at 7pm has no effect on an Omega who saw the code before 7pm, hence no effect on whether box B is full. It does not reason “but Omega knows I will change my code”. If it reasoned that way it would be TDT, not CDT, and would one-box to begin with.
      - rwallace 23 Aug 2009 23:06 UTC
        0 points
        0
        Parent
        Actually I will add another comment because I can now articulate where the ambiguity comes in: how you add self modification to CDT (which doesn’t have it in the usual form); I’ve been assuming the original algorithm doesn’t try to micromanage the new algorithm’s decisions (which strikes me as the sensible way, not least because it gives better results here); you’ve been assuming it does (which I suppose you could argue, is more true to the spirit of the original CDT).
      - rwallace 23 Aug 2009 22:21 UTC
        0 points
        0
        Parent
        I still disagree, but I agree that we have hit the limits of discussion in this comment thread; fundamentally this needs to be analyzed in a more precise language than English. We can revisit it if either of us ever gets to actually programming anything like this.
- timtyler 20 Aug 2009 7:07 UTC
  −1 points
  0
  Parent
  
  By hypothesis, Omega on examining your code at 6:59, knows that you will self-modify at 7:00 and two-box thereafter.
  
  By what hypothesis? That is not how the proposed Disposition-Based Decision Theory says it works. It claims to result in agents who have the disposition to one-box.
  - rwallace 20 Aug 2009 22:48 UTC
    2 points
    0
    Parent
    Sure. This sub thread was about plain CDT, and how it self-modifies into some form of DBDT/TDT once it figures out the benefits of doing so—and given the hypothesis of an omniscient Omega, then Omega will know that this will occur.
    - timtyler 21 Aug 2009 5:43 UTC
      −1 points
      0
      Parent
      In that case, what I think you meant to say was:
      
      Omega on examining your code at 6:59, knows that you will self-modify at 7:00 and ONE-box thereafter.
      - rwallace 21 Aug 2009 7:44 UTC
        2 points
        0
        Parent
        Doh! Thanks for the correction, editing comment.