KatjaGrace comments on Superintelligence 14: Motivation selection methods

KatjaGrace 16 Dec 2014 2:18 UTC
1 point
Can you think of more motivation selection methods for this list?
- yates9 16 Dec 2014 18:38 UTC
  3 points
  Parent
  A selection method could be created based physical measurement of its net energy demands and therefore its sustainability as part of the broader ecosystem of intelligences. New intelligences should not be able to draw in energy density to intelligence density larger than that of biological counterparts. New intelligences should enter the ecosystem maintaining the stability of the existing network. The attractive feature of this approach is that presumably maintaining or even broadening the ecosystem network is consistent with what has evolutionarily been tested over several million years, so must be relatively robust. Lets call it SuperSustainableIntelligence?
  - SteveG 18 Dec 2014 14:39 UTC
    1 point
    Parent
    That’s pretty cool-could you explain to me how it does not cause us to kill people who have expansive wants in order to reduce the progress toward entropy which they cause?
    
    I guess in your framework the goal of Superintelligence is to “Postpone the Heat Death of the Universe” to paraphrase an old play?
    - yates9 1 Jan 2015 15:31 UTC
      0 points
      Parent
      I think it might drive toward killing of those who have expensive wants that also do not occupy a special role in the network somehow. Maybe a powerful individual which is extremely wasteful and which is actively causing ecosystem collapse by breaking the network should be killed to ensure the whole civilisation can survive.
      
      I think the basic desire of a Superintelligence would be identity and maintaining that identity.. in this sense the “Postopone the Heat Death of the Universe” or even reverse that would definitely be its ultimate goal. Perhaps it would even want to become the universe.
      
      (sorry for long delay in reply I don’t get notifications)
- selylindi 17 Dec 2014 23:33 UTC
  2 points
  Parent
  Bostrom’s philosophical outlook shows. He’s defined the four categories to be mutually exclusive, and with the obvious fifth case they’re exhaustive, too.
  1. Select motivations directly. (e.g. Asimov’s 3 laws)
  2. Select motivations indirectly. (e.g. CEV)
  3. Don’t select motivations, but use ones believed to be friendly. (e.g. Augment a nice person.)
  4. Don’t select motivations, and use ones not believed to be friendly. (i.e. Constrain them with domesticity constraints.)
  5. (Combinations of 1-4.)
  In one sense, then, there aren’t other general motivation selection methods. But in a more useful sense, we might be able to divide up the conceptual space into different categories than the ones Bostrom used, and the resulting categories could be heuristics that jumpstart development of new ideas.
  
  Um, I should probably get more concrete and try to divide it differently. The following example alternative categories aren’t promised to be the kind that will effectively ripen your heuristics.
  1. Research how human values are developed as a biological and cognitive process, and simulate that in the AI whether or not we understand what will result. (i.e. Neuromorphic AI, the kind Bostrom fears most)
  2. Research how human values are developed as a social and dialectic process, and simulate that in the AI whether or not we understand what will result. (e.g. Rawls’s Genie)
  3. Directly specify a single theory of partial human value, but an important part that we can get right, and sacrifice our remaining values to guarantee this one; or indirectly specify that the AI should figure out what single principle we most value and ensure that it is done. (e.g. Zookeeper).
  4. Directly specify a combination of many different ideas about human values rather than trying to get the one theory right; or indirectly specify that the AI should do the same thing. (e.g. “Plato’s Republic”)
  The thought was to first divide the methods by whether we program the means or the ends, roughly. Second I subdivided those by whether we program it to find a unified or a composite solution, roughly. Anyhow, there may be other methods of categorizing this area of thought that more neatly carve it up at its joints.
  - William_S 18 Dec 2014 1:41 UTC
    1 point
    Parent
    Another approach might be to dump in a whole bunch of data, and hope that the simplest model that fits the data is a good model of human values (this is like Paul Christiano’s hack to attempt to specify a whole brain emulation as part of an indirect normativity if we haven’t achieved whole brain emulation capability yet: http://ordinaryideas.wordpress.com/2012/04/21/indirect-normativity-write-up/). There might be other sets of data that could be used in this way, ie. run a massive survey on philosophical problems, record a bunch of people’s brains while they watch stories play out in television, dump in DNA and hope it encodes stuff that points to brain regions relevant to morality etc. (I don’t trust this method though).