Vladimir_Nesov comments on Ingredients of Timeless Decision Theory

Vladimir_Nesov Aug 19, 2009, 8:46 PM
5 points
Nice. A while ago I also noticed that you can control any mathematical structure if it knows about you and you know about it (i.e. there is logical dependence), which generalizes the notion of trade with other possible worlds, control of the past, etc. If that other mathematical structure is interpreted as an agent, it can be made to behave as you prefer, if in return you behave is it prefers. Thus, it’s possible for us to have and realize preferences over mathematical structures, in particular by trading with them in this manner.

At the same time there are all sorts of weird limitations of what’s possible to affect this way, for example you can control something faster than light (logical control), but only with info that is already in the logical dependence, which excludes the info that only one side has. For example, if you send away a perfect simulation of your mind on a spaceship, you can “control” what happens of the spaceship if neither of you receives observations from outside, as both computations will be identical. If some info from a year ago is sent to the spaceship, and both you and the simulation observe it (simultaneously), you remain synchronized, but now you learned something new. This way, streams of observations can be sent in both directions, continuously updating both copies. These observations, being identical, are added to logical dependence between you and the simulation, and so can be used in logical control. Thus, the whole state of knowledge in shared, and the conclusions of the whole algorithm of mind can be used for control.

On the other hand, if you know something above and beyond this shared knowledge (like recent observations), you can’t use this knowledge or any conclusions reached from this knowledge in logical control. You can’t update on non-shared knowledge and retain ability to handle logical dependence. This seems related to non-updating in counterfactual mugging: you need to exercise control over the other possible world, and so you can’t update on the observation that is particular to your possible world and use the whole algorithm that includes this update to control the other world. You can “update” if you can factor your state of knowledge into what’s dependent to what and what can be used for control of what though.

Eliezer, does the formalism on Pearl’s graphs allow to capture this idea? So far, I’m not sure how much insight can be gained from studying it (and your TDT), so I leave it to after I finish learning basics of logic.
- Eliezer Yudkowsky Aug 19, 2009, 9:09 PM
  2 points
  Parent
  I think you could use a non-updated Pearl graph for your updateless decision theory, but the part where you (instead of updating) decide which computational processes are similar or dissimilar to you, would be a logical problem, I think, not the domain of causal graphs.
  - Vladimir_Nesov Aug 19, 2009, 9:24 PM
    1 point
    Parent
    Not-updating is the same kind of simplified denotational behemoth as a GLUT. Much of the usefulness of probabilistic graphical models comes from the fact that they compress the probability distribution into smaller representations and allow manipulation and specification of these distributions in terms of the compact representations. If I just start copying a lot of the graphical models, it won’t capture the structure of the problem, so instead of being updateless, the decision theory must update what it can, or represent a lot of partially dependent states of knowledge in a single structure, allowing to extract decisions unaffected by the knowledge that doesn’t belong to them.
    
    I suspect that expectation maximization/probability won’t play an important role in this structure, as the structure of graphical models seems to capture the same objects as logical dependence must (where do you get the causal graphs from?), and so a structure that can work with logical (in)dependence may already contain the structure captured by probabilistic graphical models, subsuming the latter.