RobinHanson comments on Reflections on Pre-Rationality

RobinHanson 12 Nov 2009 0:19 UTC
1 point
I just don’t see pre-rationality being much tied to whether you in fact had a rational creator. The point is, as you say, to consider the info in the way you were created. I certainly do think one should also consider the info in the preferences we were given, as well as the beliefs, but I just don’t see this implying that we should have the common preferences. If you could prove something similar to what I proved about common priors for common utility functions, that would be very interesting.
What links here?
- Confusions Concerning Pre-Rationality by abramdemski (23 May 2018 0:01 UTC; 15 points)
- Wei Dai 12 Nov 2009 3:09 UTC
  1 point
  Parent
  In your paper, you defined pre-rationality as having the same beliefs as a hypothetical pre-agent who learns about your prior assignment. Let’s extend this definition to values. You are extended-pre-rational if each counterfactual version of you (e.g., where you inherited a different assortment of genes from your parents) has the same beliefs and values as a pre-agent who learns about his prior and utility function assignments. Since values (i.e., utility functions) don’t change or update upon learning new info, all counterfactual versions of you must have the same utility functions, if they are extended-pre-rational.
  
  Does that make sense, or do I need to formalize the argument?
  - RobinHanson 12 Nov 2009 13:23 UTC
    0 points
    Parent
    You can define a similar “pre” condition, but it is far less clear why one should satisfy such a condition. Beliefs are about the world out there, so it seems clearer that you don’t want your beliefs to change when you change but the world out there doesn’t change. Values are about you, so it seems reasonable for your values to change even when the world out there doesn’t change.
    What links here?
    zulupineapple's comment on Confusions Concerning Pre-Rationality by abramdemski (24 May 2018 18:55 UTC; 1 point)
    - Wei Dai 13 Nov 2009 19:24 UTC
      0 points
      Parent
      Are beliefs just about the world out there, or are they also partly about you? Certainly, as a matter of fact, people’s beliefs do change when they change but the outside world doesn’t change. According to standard normative rationality (i.e., expected utility maximization) that’s irrational, but under EU maximization it’s also irrational to change one’s values, since that causes inconsistencies between decisions made at different points in time.
      
      I think there is a line between the objective and subjective parts of preference (or as you put it, what’s about you and what’s about the world), but perhaps it should be drawn somewhere other than between the prior and the utility function. But right now that’s little more than a vague idea.
      - RobinHanson 13 Nov 2009 19:56 UTC
        1 point
        Parent
        Well among economists it is accepted as rational for your preferences to change with context, including time. As you probably know there are EU equivalence theorems that for any p0,U0, there are many other p1,U1; p2,U2; etc. that produce all the same choices. I break this symmetry by saying the p is about the world while the U is about you. The patterns of choice that are explained by changes in you should go in U, and the patters of choices that are explained by changes in what you believe about the world go in p.
        Wei Dai 13 Nov 2009 22:00 UTC
        1 point
        Parent
        
        Well among economists it is accepted as rational for your preferences to change with context, including time.
        
        That’s surprising for me to hear, and seems to contradict the information given at http://en.wikipedia.org/wiki/Time_inconsistency#In_behavioral_economics
        
        Exponential discounting and, more generally, time-consistent preferences are often assumed in rational choice theory, since they imply that all of a decision-maker’s selves will agree with the choices made by each self.
        
        Later on it says:
        
        This would imply disagreement by people’s different selves on decisions made and a rejection of the time consistency aspect of rational choice theory.
        
        But I thought this rejection means rejection as a positive/descriptive theory of how humans actually behave, not as a normative theory of what is rational. Are you saying that economists no longer consider time consistency to be normative?
        
        ETA: Whoever is voting Robin down, why are you doing that?
        RobinHanson 14 Nov 2009 0:23 UTC
        1 point
        Parent
        Conflicts are unfortunate, but hardly irrational. If is is not irrational for two different people at the same time to have different preferences, it is not irratoinal for the same person at different time to have different preferences.
        Wei Dai 14 Nov 2009 4:24 UTC
        0 points
        Parent
        I have to admit, I always thought of time consistency as a standard part of individual rationality, and didn’t consider that anyone might take the position that you’re taking. I’ll have to think about this some more. In the mean time, what about my other question, how to actually become pre-rational? Have you looked at this comment yet?
        What links here?
        Wei Dai's comment on Agree, Retort, or Ignore? A Post From the Future by Wei Dai (25 Nov 2009 0:20 UTC; 12 points)
        Nick_Tarleton 14 Nov 2009 1:00 UTC
        0 points
        Parent
        If people could cheaply bind their future selves, and didn’t directly prefer not to do so, it would be irrational of them to let their future selves have different preferences.
        RobinHanson 14 Nov 2009 1:11 UTC
        4 points
        Parent
        If you owned any slave and could cheaply do so, you’d want to mold it to share exactly your preferences. But should you treat your future selves as your slaves?
        Wei Dai 20 Nov 2009 11:32 UTC
        6 points
        Parent
        Upon further reflection, I think altruism towards one’s future selves can’t justify having different preferences, because there should be a set of compromise preferences such that both your current self and your future selves are better off if you bind yourself (both current and future) to that set.
        Wei Dai 17 Nov 2009 20:27 UTC
        2 points
        Parent
        The logical structure of this argument is flawed. Here’s another argument that shares the same structure, but is clearly wrong:
        
        If you owned any slave and could cheaply do so, you’d want to ensure it doesn’t die of neglect. But should you treat your future selves as your slaves?
        
        Here’s another version that makes more sense:
        
        If you had an opportunity to mold a friend to share exactly your preferences, and could do so cheaply, you might still not want to do so, and wouldn’t be considered irrational for it. So why should you be considered irrational for not molding your future selves to share exactly your preferences?
        
        One answer here might be that changing your friend’s preferences is a wrong because it hurts him according to his current preferences. Doing the same to your future selves isn’t wrong because they don’t exist yet. But I think Robin’s moral philosophy says that we should respect the preferences of nonexistent people, so his position seems consistent with that.
        timtyler 15 Nov 2009 14:24 UTC
        2 points
        Parent
        This seems like the well-worn discussion on whether rational agents should be expected to change their preferences. Here’s Omohundro on the topic:
        
        “Their utility function will be precious to these systems. It encapsulates their values and any changes to it would be disastrous to them. If a malicious external agent were able to make modifications, their future selves would forevermore act in ways contrary to their current values. This could be a fate worse than death! Imagine a book loving agent whose utility function was changed by an arsonist to cause the agent to enjoy burning books. Its future self not only wouldn’t work to collect and preserve books, but would actively go about destroying them. This kind of outcome has such a negative utility that systems will go to great lengths to protect their utility functions.”
        
        http://selfawaresystems.files.wordpress.com/2008/01/ai_drives_final.pdf
        
        He goes on to discuss the issue in detail and lists some exceptional cases.