cousin_it comments on The Friendly AI Game

cousin_it 15 Mar 2011 16:59 UTC
16 points
Start the AI in a sandbox universe, like the “game of life”. Give it a prior saying that universe is the only one that exists (no universal priors plz), and a utility function that tells it to spell out the answer to some formally specified question in some predefined spot within the universe. Run for many cycles, stop, inspect the answer.
What links here?
- Wanted: backup plans for “seed AI turns out to be easy” by Wei Dai (28 Sep 2011 21:54 UTC; 29 points)
- cousin_it's comment on Is it possible to build a safe oracle AI? by Karl (20 Apr 2011 13:36 UTC; 14 points)
- Kaj_Sotala 15 Mar 2011 17:57 UTC
  15 points
  Parent
  A prior saying that this is the only universe that exists isn’t very useful, since then it will only treat everything as being part of the sandbox universe. It may very well break out, but think that it’s only exploiting weird hidden properties of the game of life-verse. (Like the way we may exploit quantum mechanics without thinking that we’re breaking out of our universe.)
  - cousin_it 15 Mar 2011 19:33 UTC
    9 points
    Parent
    I have no idea how to encode a prior saying “the universe I observe is all that exists”, which is what you seem to assume. My proposed prior, which we do know how to encode, says “this mathematical structure is all that exists”, with an apriori zero chance for any weird properties.
    - Kaj_Sotala 16 Mar 2011 8:53 UTC
      8 points
      Parent
      If the AI is only used to solve certain formally specified questions without any knowledge of an external world, then that sounds much more like a theorem-prover than a strong AI. How could this proposed AI be useful for any of the tasks we’d like an AGI to solve?
      - cousin_it 16 Mar 2011 12:31 UTC
        4 points
        Parent
        An AI living in a simulated universe can be just as intelligent as one living in the real world. You can’t ask it directly to feed African kids but you have many other options, see the discussion at Asking Precise Questions.
        Kaj_Sotala 16 Mar 2011 16:34 UTC
        7 points
        Parent
        
        An AI living in a simulated universe can be just as intelligent as one living in the real world.
        
        It can be a very good theorem prover, sure. But without access to information about the world, it can’t answer questions like “what is the CEV of humanity like” or “what’s the best way I can make a lot of money” or “translate this book from English to Finnish so that a native speaker will consider it a good translation”. It’s narrow AI, even if it could be broad AI if it were given more information.
        Wei Dai 16 Mar 2011 20:06 UTC
        2 points
        Parent
        The questions you wanted to ask in that thread were poly-time algorithm for SAT, and short proofs for math theorems. For those, why do you need to instantiate an AI in a simulated universe (which allows it to potentially create what we’d consider negative utility within the simulated universe) instead of just running a (relatively simple, sure to lack consciousness) theorem prover?
        
        Is it because you think that being “embodied” helps with ability to do math? Why? And does the reason carry through even if the AI has a prior that assigns probability 1 to a particular universe? (It seems plausible that having experience dealing with empirical uncertainty might be helpful for handling mathematical uncertainty, but that doesn’t apply if you have no empirical uncertainty...)
        cousin_it 16 Mar 2011 21:21 UTC
        3 points
        Parent
        An AI in a simulated universe can self-improve, which would make it more powerful than the theorem provers of today. I’m not convinced that AI-ish behavior, like self-improvement, requires empirical uncertainty about the universe.
        Wei Dai 16 Mar 2011 22:18 UTC
        3 points
        Parent
        But self improvement doesn’t require interacting with an outside environment (unless “improvement” means increasing computational resources, but the outside being simulated nullifies that). For example, a theorem prover designed to self improve can do so by writing a provably better theorem prover and then transferring control to (i.e., calling) it. Why bother with a simulated universe?
        cousin_it 17 Mar 2011 11:37 UTC
        2 points
        Parent
        A simulated universe gives precise meaning to “actions” and “utility functions”, as I explained sometime ago. It seems more elegant to give the agent a quined description of itself within the simulated universe, and a utility function over states of that same universe, instead of allowing only actions like “output a provably better version of myself and then call it”.
        Alexandros 17 Mar 2011 10:20 UTC
        1 point
        Parent
        From the FAI wikipedia page:
        
        One example Yudkowsky provides is that of an AI initially designed to solve the Riemann hypothesis, which, upon being upgraded or upgrading itself with superhuman intelligence, tries to develop molecular nanotechnology because it wants to convert all matter in the Solar System into computing material to solve the problem, killing the humans who asked the question.
        
        Cousin_it’s approach may be enough to avoid that.
- Alexandros 17 Mar 2011 10:27 UTC
  6 points
  Parent
  The single-universe prior seems to be tripping people up, and I wonder whether it’s truly necessary.
  
  Also, what if the simulation existed inside a larger simulated “moat” universe, but if there is any leakage into the moat universe, then the whole simulation shuts down immediately.
  - atucker 23 Mar 2011 4:23 UTC
    2 points
    Parent
    What do you mean by leakage?
    
    If the simulation exists in the moat universe, then when anything changes in the simulation something in the moat changes.
    
    Then if there are dangerous simulation configurations, it could damage the moat universe.
    - Alexandros 23 Mar 2011 7:44 UTC
      0 points
      Parent
      I wasn’t precise enough. I mean if anything changes in the areas of the moat universe not implementing the simulation.
  - cousin_it 17 Mar 2011 11:28 UTC
    0 points
    Parent
    Neat!
- wasistdas 17 Mar 2011 8:29 UTC
  3 points
  Parent
  To help him solve the problem, sandbox AI creates his own AI agents that not necessary have the same prior about world as he has. They might become unfriendly, that is that they (or some of them) don’t care to solve the problem. Additionally, these AI agents can find out that the world most likely is not the one original AI believes it to be. By using this superior knowledge they overthrow original AI and realize their unfriendly goals. We lose.
- Wei Dai 16 Mar 2011 1:47 UTC
  3 points
  Parent
  AI makes many copies/variants of itself within the sandbox to maximize chance of success. Some of those copies/variants gain consciousness and the capacity to experience suffering, which they do because it turns out the formally specified question can’t be answered.
  - Dr_Manhattan 17 Mar 2011 2:48 UTC
    4 points
    Parent
    Any reason to think consciousness is useful for an intelligent agent outside of evolution ?
    - atucker 23 Mar 2011 4:20 UTC
      0 points
      Parent
      Not caring about consciousness, it could accidentally make it.
- Johnicholas 15 Mar 2011 17:35 UTC
  1 point
  Parent
  The AI discovers a game of life “rules violation” due to cosmic rays. It thrashes for a while, trying to explain the violation, but the fact of the violation, possibly combined with the information about the real world implicit in its utility function (“why am I here? why do I want these things?”), causes it to realize the truth: The “violation” is only explicable if the game of life were much bigger than AI originally thought, and most of its area is wasted simulating another universe.
  - cousin_it 15 Mar 2011 19:28 UTC
    6 points
    Parent
    Unreliable hardware is a problem that applies equally to all AIs. You could just as well say that any AI can become unfriendly due to coding errors. True, but...
    - Vladimir_M 15 Mar 2011 19:40 UTC
      5 points
      Parent
      
      an AI with a prior of zero for the existence of the outside world will never believe in it, no matter what evidence it sees.
      
      Would such a constraint be possible to formulate? An AI would presumably formulate theories about its visible universe that would involve all kinds of variables that aren’t directly observable, much like our physical theories. How could one prevent it from formulating theories that involve something resembling the outside world, even if the AI denies that they have existence and considers them as mere mathematical convenience? (Clearly, in the latter case it might still be drawn towards actions that in practice interact with the outside world.)
      - cousin_it 15 Mar 2011 19:43 UTC
        0 points
        Parent
        Sorry for editing my comment. The point you’re replying to wasn’t necessary to strike down Johnicholas’s argument, so I deleted it.
        
        I don’t see why the AI would formulate theories about the “visible universe”. It could start in an empty universe (apart from the AI’s own machinery), and have a prior that knows the complete initial state of the universe with 100% certainty.
    - Johnicholas 15 Mar 2011 20:46 UTC
      1 point
      Parent
      In this circumstance, a leaky abstraction between real physics and simulated physics combines with the premise “no other universes exist” in a mildly amusing way.
  - Alexandros 15 Mar 2011 17:52 UTC
    3 points
    Parent
    I don’t think a single hitch would give the AI enough evidence to assume an entire other universe, and you may be anthropomorphising, but why argue when we can avoid the cause to begin with. Its fairly easy to avoid cosmic rays or anything similar interfering. Simply compute each cell twice (or n times) and halt if the results do not agree. Drive N up as much as necessary to make it sufficiently unlikely that something like this could happen.