steven0461 comments on AI Risk and Opportunity: A Strategic Analysis

steven0461 4 Mar 2012 20:45 UTC
3 points
I don’t understand. Is the claim here that you can build a “decide whether the risk of botched Friendly AI is worth taking machine”, and the risk of botching such a machine is much less than the risk of botching a Friendly AI?
- Vladimir_Nesov 4 Mar 2012 21:28 UTC
  9 points
  Parent
  A FAI that includes such “Should I run?” heuristic could pose a lesser risk than a FAI without such heuristic. If this heuristic works better than human judgment about running a FAI, it should be used instead of human judgment.
  
  This is the same principle as for AI’s decisions themselves, where we don’t ask AI’s designers for object-level moral judgments, or encode specific object-level moral judgments into AI. Not running an AI would then be equivalent to hardcoding the decision “Should the AI run?” resolved by designers to “No.” into the AI, instead of coding the question and letting the AI itself answer it (assuming we can expect it to answer the question more reliably than the programmers can).
  - steven0461 4 Mar 2012 22:09 UTC
    5 points
    Parent
    If we botched the FAI, wouldn’t we also probably have botched its ability to decide whether it should run?
    - Vladimir_Nesov 4 Mar 2012 22:45 UTC
      2 points
      Parent
      Yes, and if it tosses a coin, it has 50% chance of being right. The question is calibration, how much trust should such measures buy compared to their absence, given what is known about given design.