wheninrome15 comments on Open Thread: April 2010

wheninrome15 2 Apr 2010 0:53 UTC
1 point
Is there any chance that we (a) CAN’T restrict AI to be friendly per se, but (b) (conditional on this impossibility) CAN restrict it to keep it from blowing up in our faces? If friendly AI is in fact not possible, then first generation AI may recognize this fact and not want to build a successor that would destroy the first generation AI in an act of unfriendliness.

It seems to me like the worst case would be that Friendly AI is in fact possible...but that we aren’t the first to discover it. In which case AI would happily perpetuate itself. But what are the best and worst case scenarios conditioning on Friendly AI being IMpossible?

Has this been addressed before? As a disclaimer, I haven’t thought much about this and I suspect that I’m dressing up the problem in a way that sounds different to me only because I don’t fully understand the implications.
- PhilGoetz 2 Apr 2010 2:14 UTC
  2 points
  Parent
  
  Is there any chance that we (a) CAN’T restrict AI to be friendly per se, but (b) (conditional on this impossibility) CAN restrict it to keep it from blowing up in our faces?
  
  First, define “friendly” in enough detail that I know that it’s different from “will not blow up in our faces”.
  - RobinZ 2 Apr 2010 2:27 UTC
    0 points
    Parent
    Ooh, good catch! wheninrome15 may need to define “will not blow up in our faces” in more detail as well.
- RobinZ 2 Apr 2010 1:03 UTC
  1 point
  Parent
  Such an eventuality would seem to require that (a) human beings are not computable or (b) human beings are not Friendly.
  
  In the latter case, if nothing else, there is [individual]-Friendliness to consider.
  - Kevin 2 Apr 2010 1:16 UTC
    2 points
    Parent
    I think human history has demonstrated that (b) is certainly true… sometimes I am surprised we are still here.
    - RobinZ 2 Apr 2010 1:58 UTC
      2 points
      Parent
      The argument from (b)* is one of the stronger ones I’ve heard against FAI.
      
      * Not to be confused with the argument from /b/.
      - ata 2 Apr 2010 10:59 UTC
        2 points
        Parent
        Incidentally, /b/ might be good evidence for (b). It’s a rather unsettling demonstration of what people do when anonymity has removed most of the incentive for signaling.
        taw 2 Apr 2010 13:23 UTC
        3 points
        Parent
        I find chans’ lack of signaling highly intellectually refreshing. /b/ is not typical—due to ridiculously high traffic only meme-infested threads that you can reply to in 5 seconds survive. Normal boards have far better discussion quality.