Stuart_Armstrong comments on Siren worlds and the perils of over-optimised search

Stuart_Armstrong 29 Apr 2014 11:00 UTC
1 point
0
yes, but what about other AIs that might be created, maybe without liberty as a top goal—it would need to act to prevent them from being built! It’s unlikely that “destroy itself” is the best option it can find...
- drnickbone 29 Apr 2014 11:30 UTC
  0 points
  0
  Parent
  Except that acting to prevent other AIs from being built would also encroach on human liberty, and probably in a very major way if it was to be effective! The AI might conclude from this that liberty is a lost cause in the long run, but it is still better to have a few extra years of liberty (until the next AI gets built), rather than ending it right now (through its own powerful actions).
  
  Other provocative questions: how much is liberty really a goal in human values (when taking the CEV for humanity as a whole, not just liberal intellectuals)? How much is it a terminal goal, rather than an instrumental goal? Concretely, would humans actually care about being ruled over by a tyrant, as long as it was a good tyrant? (Many people are attracted to the idea of an all-powerful deity for instance, and many societies have had monarchs who were worshipped as gods.) Aren’t mechanisms like democracy, separation of powers etc mostly defence mechanisms against a bad tyrant? Why shouldn’t a powerful “good” AI just dispense with them?
  - Stuart_Armstrong 29 Apr 2014 12:08 UTC
    0 points
    0
    Parent
    A certain impression of freedom is valued by humans, but we don’t seem to want total freedom as a terminal goal.
    - [deleted] 29 Apr 2014 13:11 UTC
      3 points
      0
      Parent
      Well of course we don’t. Total freedom is an incoherent goal: the only way to ensure total future freedom of action is to make sure nothing ever happens, thus maximizing the number of available futures without ever actually choosing one.
      
      As far as I’ve been able to reason out, the more realistic human conception of freedom is: “I want to avoid having other agenty things optimize me (for their preferences (unilaterally)).” The last part is there because there are mixed opinions on whether you’ve given up your ethical freedom if an agenty thing optimizes you for your preferences (as might happen in ideal situations, such as dealing with an FAI handing out transhuman candy), or whether you’ve given up your ethical freedom if you bind yourself to implement someone else’s preferences mixed-in with your own (for instance, by getting married).
      - Lumifer 29 Apr 2014 14:58 UTC
        4 points
        0
        Parent
        
        the only way to ensure total future freedom of action is to make sure nothing ever happens, thus maximizing the number of available futures without ever actually choosing one.
        
        That doesn’t make sense—why would the status quo, whatever it is, always maximize the number of available futures? Choosing a future does not restrict you, it does close some avenues but also opens other ones.
        
        “Total freedom” is a silly concept, of course, but it’s just as silly as “Total ”.
        Stuart_Armstrong 29 Apr 2014 18:54 UTC
        0 points
        0
        Parent
        Total happiness seems to make more plausible sense than total freedom.
        Lumifer 29 Apr 2014 19:33 UTC
        0 points
        0
        Parent
        Not sure how you determine degrees of plausibility :-/
        
        The expression “total happiness” (other than in contexts of the “it’s like, dude, I was so totally chill and happy” kind) makes no more sense to me than “total freedom”.
      - Stuart_Armstrong 29 Apr 2014 18:56 UTC
        0 points
        0
        Parent
        Assume B choose without coercion, but assume A always knows what B will choose and can set up various facts in the world to determine B’s choice. Is B free?
        PhilosophyTutor 2 May 2014 0:49 UTC
        2 points
        0
        Parent
        I think there is insufficient information to answer the question as asked.
        
        If I offer you the choice of a box with $5 in it, or a box with $500 000 in it, and I know that you are close enough to a rational utility-maximiser that you will take the $500 000, then I know what you will choose and I have set up various facts in the world to determine your choice. Yet it does not seem on the face of it as if you are not free.
        
        On the other hand if you are trying to decide between being a plumber or a blogger and I use superhuman AI powers to subtly intervene in your environment to push you into one or the other without your knowledge then I have set up various facts in the world to determine your choice and it does seem like I am impinging on your freedom.
        
        So the answer seems to depend at least on the degree of transparency between A and B in their transactions. Many other factors are almost certainly relevant, but that issue (probably among many) needs to be made clear before the question has a simple answer.
        Stuart_Armstrong 2 May 2014 13:43 UTC
        0 points
        0
        Parent
        Can you cash out the difference between those two cases in sufficient detail that we can use it to safely defined what liberty means?
        PhilosophyTutor 2 May 2014 20:37 UTC
        −2 points
        0
        Parent
        I said earlier in this thread that we can’t do this and that it is a hard problem, but also that I think it’s a sub-problem of strong AI and we won’t have strong AI until long after we’ve solved this problem.
        
        I know that Word of Eliezer is that disciples won’t find it productive to read philosophy, but what you are talking about here has been discussed by analytic philosophers and computer scientists as “the frame problem” since the eighties and it might be worth a read for you. Fodor argued that there are a class of “informationally unencapsulated” problems where you cannot specify in advance what information is and is not relevant to solving the problem, hence really solving them as opposed to coming up with a semi-reliable heuristic is an incredibly difficult problem for AI. Defining liberty or identifying it in the wild seems like it’s an informationally unencapsulated problem in that sense and hence a very hard one, but one which AI has to solve before it can tackle the problems humans tackle.
        
        If I recall correctly Fodor argued in Modules, Frames, Fridgeons, Sleeping Dogs, and the Music of the Spheres that this problem was in fact the heart of the AI problem, but that proposition was loudly raspberried in the literature by computer scientists. You can make up your own mind about that one.
        
        Here’s a link to the Stanford Dictionary of Philosophy page on the subject.
        Stuart_Armstrong 6 May 2014 14:02 UTC
        0 points
        0
        Parent
        
        If I recall correctly Fodor argued in Modules, Frames, Fridgeons, Sleeping Dogs, and the Music of the Spheres that this problem was in fact the heart of the AI problem
        
        It depends on how general or narrow you make the problem. Compare: is classical decision theory the heart of the AI problem? If you interpret this broadly, then yes; but the link from, say, car navigation to classical decision theory is tenuous when you’re working on the first problem. The same thing for the frame problem.
        Stuart_Armstrong 6 May 2014 12:14 UTC
        0 points
        0
        Parent
        
        I know that Word of Eliezer is that disciples won’t find it productive to read philosophy, but what you are talking about here has been discussed by analytic philosophers and computer scientists as “the frame problem” since the eighties and it might be worth a read for you
        
        You mean the frame problem that I talked about here? http://lesswrong.com/lw/gyt/thoughts_on_the_frame_problem_and_moral_symbol/
        
        The issue can be talked about in terms of the frame problem, but I’m not sure it’s useful. In the classical frame problem, we have a much clearer idea of what we want, the problem is specifying enough so that the AI does too (ie so that the token “loaded” corresponds to the gun being loaded). This is quite closely related to symbol grounding, in a way.
        
        When dealing with moral problems, we have the problem that we haven’t properly defined the terms to ourselves. Across the span of possible futures, the term “loaded gun” is likely much sharply defined than “living human being”. And if it isn’t—well, then we have even more problems if all our terms start becoming slippery, even the ones with no moral connotations.
        
        But in any case, saying the problem is akin to the frame problem… still doesn’t solve it, alas!
        Shmi 2 May 2014 23:40 UTC
        0 points
        0
        Parent
        Note that the relevance issue has been successfully solved in any number of complex practical applications, such as the self-driving vehicles, which are able to filter out gobs of irrelevant data, or the LHC code, which filters out even more. I suspect that the Framing Problem is not some general problem that needs to be resolved for AI to work, but just one of many technical issues, just as the “computer scientists” suggest. On the other hand, it is likely to be a real problem for FAI design, where relying to heuristics providing, say, six-sigma certainty just isn’t good enough.
        
        I think that the framing problem is distinct from the problem of defining and calculating
        
        things like liberty, which seem like obvious qualities to specify in an optimal world we are building an AI to search for
        
        mostly because attempting to define liberty objectively leads us to the discussion of free will, the latter being an illusion due to the human failure to introspect deep enough.
        PhilosophyTutor 3 May 2014 0:17 UTC
        0 points
        0
        Parent
        I tend to think that you don’t need to adopt any particular position on free will to observe that people in North Korea lack freedom from government intervention in their lives, access to communication and information, a genuine plurality of viable life choices and other objectively identifiable things humans value. We could agree for the sake of argument that “free will is an illusion” (for some definitions of free will and illusion) yet still think that people in New Zealand have more liberty than people in North Korea.
        
        I think that you are basically right that the Framing Problem is like the problem of building a longer bridge, or a faster car, in that you are never going to solve the entire class of problem at a stroke so that you can make infinitely long bridges or infinitely fast cars but that you can make meaningful incremental progress over time. I’ve said from the start that capturing the human ability to make philosophical judgments about liberty is a hard problem but I don’t think it is an impossible one—just a lot easier than creating a program that does that and solves all the other problems of strong AI at once.
        
        In the same way that it turns out to be much easier to make a self-driving car than a strong AI, I think we’ll have useful natural-language parsing of terms like “liberty” before we have strong AI.
        Shmi 3 May 2014 2:37 UTC
        2 points
        0
        Parent
        
        I tend to think that you don’t need to adopt any particular position on free will to observe that people in North Korea lack freedom from government intervention in their lives, access to communication and information, a genuine plurality of viable life choices and other objectively identifiable things humans value.
        
        Well, yes, it is hard to argue about NK vs West. But let’s try to control for the “non-liberty” confounders, such as income, wealth, social status, etc. Say, we take some upper middle-class person from Iran, Russia or China. It is quite likely that, when comparing their life with that of a Westerner of similar means, they would not immediately state that the Western person has more “objectively identifiable things humans value”. Obviously the sets of these valuable things are different, and the priorities different people assign to them would be different, but I am not sure that there is a universal measure everyone would agree upon as “more liberty”.
        Expand this thread
        PhilosophyTutor 3 May 2014 3:03 UTC
        2 points
        0
        Parent
        A universal measure for anything is a big demand. Mostly we get by with some sort of somewhat-fuzzy “reasonable person” standard, which obviously we can’t fully explicate in neurological terms either yet, but which is much more achievable.
        
        Liberty isn’t a one-dimensional quality either, since for example you might have a country with little real freedom of the press but lots of freedom to own guns, or vice versa.
        
        What you would have to do to develop a measure with significant intersubjective validity is to ask a whole bunch of relevantly educated people what things they consider important freedoms and what incentives they would need to be offered to give them up, to figure out how they weight the various freedoms. This is quite do-able, and in fact we do very similar things when we do QALY analysis of medical interventions to find out how much people value a year of life without sight compared to a year of life with sight (or whatever).
        
        Fundamentally it’s not different to figuring out people’s utility functions, except we are restricting the domain of questioning to liberty issues.
        [deleted] 30 Apr 2014 10:48 UTC
        0 points
        0
        Parent
        
        Assume B choose without coercion, but assume A always knows what B will choose and can set up various facts in the world to determine B’s choice. Is B free?
        
        So, just checking before I answer: you’re claiming that no direct, gun-to-the-head coercion is employed, but Omega can always predict your actions and responses, and sets things up to ensure you will choose a specific thing.
        
        Are you free, or are you in some sense “serving” Omega? I answer: The latter, very, very, very definitely.
        
        If we take it out of abstract language, real people manipulate each-other all the time, and we always condemn it as a violation of the ethical principle of free choice. Yes, sometimes there are principles higher than free choice, as with a parent who might say, “Do your homework or you get no dessert” (treat that sentence as a metasyntactic variable for whatever you think is appropriate parenting), but we still prefer, all else equal, that our choices and those of others be less manipulated rather than more.
        
        Just because fraud and direct coercion are the usual standards for proving a violation of free choice in a court of law, for instance in order to invalidate a legal contract, does not mean that these are the all-and-all of the underlying ethics of free choice.
        Stuart_Armstrong 30 Apr 2014 15:24 UTC
        0 points
        0
        Parent
        
        Are you free, or are you in some sense “serving” Omega? I answer: The latter, very, very, very definitely.
        
        Then if Omega is superintelligent, it has a problem: every single decision it makes increases or decreases the probability of someone answering something or other, possibly by a large amount. It seems Omega cannot avoid being coercive, just because it’s so knowledgeable.
        [deleted] 1 May 2014 7:09 UTC
        2 points
        0
        Parent
        We don’t quite know that, and there’s also the matter of whether Omega is deliberately optimizing those people or they’re just reacting to Omega’s optimizing the inanimate world (which I would judge to be acceptable and, yes, unavoidable).
        Stuart_Armstrong 2 May 2014 13:41 UTC
        0 points
        0
        Parent
        It is an interesting question, though, and illustrates the challenges with “liberty” as a concept in these circumstances.
        [deleted] 2 May 2014 16:12 UTC
        0 points
        0
        Parent
        Well yes. It’s also why many people have argued in favor of Really Powerful Optimizers just… doing absolutely nothing.
        Stuart_Armstrong 6 May 2014 11:47 UTC
        0 points
        0
        Parent
        That I don’t see.