cousin_it comments on “Stupid” questions thread

cousin_it 13 Jul 2013 8:48 UTC
25 points
For what it’s worth, Eliezer’s answer to your second question is here:

There is no safe wish smaller than an entire human morality. (...) With a safe genie, wishing is superfluous. Just run the genie.
- timtyler 14 Jul 2013 11:46 UTC
  0 points
  Parent
  
  There is no safe wish smaller than an entire human morality.
  
  Is that true? Why can’t the wish point at what it wants (e.g. the wishes of particular human X) - rather than spelling it out in detail?
  - drethelin 14 Jul 2013 16:50 UTC
    3 points
    Parent
    The first problem is the wish would have to be extremely good at pointing.
    
    This sounds silly but what I mean is that humans are COMPLICATED. “Pointing” at a human and telling an AI to deduce things about it will come up with HUGE swathes of data which you have to have already prepared it to ignore or pay attention to. To give a classic simple example, smiles are a sign of happiness but we do not want to tile the universe in smiley faces or create an artificial virus that constricts your face into a rictus and is highly contagious.
    
    Second: assuming that works, it works primarily for one person, which is giving that person a lot more power than I think most people want to give any one person. But if we could guarantee an AI would fulfill the values of A person rather than of multiple people and someone else was developing AI that wasn’t guarunteed to fulfill any values I’d probably take it.
  - ESRogs 16 Jul 2013 1:05 UTC
    2 points
    Parent
    To spell out some of the complications—does the genie only respond to verbal commands? What if the human is temporarily angry at someone and an internal part of their brain wishes them harm. The genie needs to know not to act on this. So it must have some kind of requirement for reflective equilibrium.
    
    Suppose the human is duped into pursuing some unwise course of action? The genie needs to reject their new wishes. But the human should still be able to have their morality evolve over time.
    
    So you still need a complete CV Extrapolator. But maybe that’s what you had in mind be pointing at the wishes of a particular human?
- ikrase 14 Jul 2013 3:02 UTC
  0 points
  Parent
  I think that Obedient AI requires less fragility-of-values types of things.
  - Eliezer Yudkowsky 14 Jul 2013 4:22 UTC
    6 points
    Parent
    I don’t see why a genie can’t kill you just as hard by missing one dimension of what it meant to satisfy your wish.
    - ikrase 14 Jul 2013 10:23 UTC
      0 points
      Parent
      I’m not talking naive obedient AI here. I’m talking a much less meta FAI that does not do analysis of metaethics or CEV or do incredibly vague, subtle wishes. (Atlantis in HPMOR may be an example of a very weak, rather irrational, poorly safeguarded Obedient AI with a very, very strange command set.)