Kaj_Sotala comments on The Inefficiency of Theoretical Discovery

Kaj_Sotala 5 Nov 2013 5:51 UTC
4 points
I don’t think the argument is that AI would be fundamentally different, but rather that “we can reason at least somewhat reliably when making predictions of agents who don’t drastically self-modify, and of whom we have thousands of years of data to help build our predictions on” isn’t good enough to deal with the case of a drastically self-modifying agent that could exhibit entirely novel behavior and cognitive dynamics even if it wasn’t capable of self-modifying. “Somewhat reliably” is fine only as long as a single failure isn’t enough to throw all the rest of your predictions to the trash bin.

I don’t know enough about your second example to feel confident commenting on it.
- jsteinhardt 5 Nov 2013 6:03 UTC
  5 points
  Parent
  
  “Somewhat reliably” is fine only as long as a single failure isn’t enough to throw all the rest of your predictions to the trash bin.
  
  Humans seem pretty good at making correct predictions even if they have made incorrect predictions in the past. More generally, any agent for whom a single wrong prediction throws everything into disarray will probably not continue to function for very long.
  
  I don’t know enough about your second example to feel confident commenting on it.
  
  Fair enough. This is an admirable habit that is all too rare, so have an upvote :).
  - Kaj_Sotala 5 Nov 2013 7:30 UTC
    3 points
    Parent
    
    Humans seem pretty good at making correct predictions even if they have made incorrect predictions in the past. More generally, any agent for whom a single wrong prediction throws everything into disarray will probably not continue to function for very long.
    
    That’s basically my point. A human has to predict the answer to questions of the type “what would I do in situation X”, and their overall behavior is the sum of their actions over all situations, so they can still get the overall result roughly correct as long as they are correct on average. An AI that’s capable of self-modification also has to predict the answer to questions of the type “how would my behavior be affected if I modified my decision-making algorithm in this way”, where the answer doesn’t just influence the behavior in one situation but all the ones that follow. The effects of individual decisions become global rather than local. It needs to be able to make much more reliable predictions if it wants to have a chance of even remaining basically operational over the long term.
    
    Fair enough. This is an admirable habit that is all too rare, so have an upvote :).
    
    Thanks. :)
    - jmmcd 8 Nov 2013 21:09 UTC
      0 points
      Parent
      And more important, its creators want to be sure that it will be very reliable before they switch it on.