avturchin comments on Best arguments against worrying about AI risk?

avturchin 23 Dec 2018 15:52 UTC
1 point
Such oversaturation may result in two or more internally consistent but poorly compatible AI Safety theories (e.g. distillation-amplification vs. agent foundation), which will end up in two different friendly AI and the the war for correct friendliness will start.
- Donald Hobson 23 Dec 2018 21:28 UTC
  2 points
  Parent
  If we have two distinct AI safety plans, the researchers are sensible to have a big discussion on which is better and only turn that one on. If not, and neither AI is fatally flawed, I would expect them to cooperate, they have very similar goals and neither wants war.
  - avturchin 23 Dec 2018 21:49 UTC
    1 point
    Parent
    Historically, it didn’t work. A most bitter conflicts were between two main branches of, say, Islam, - Shia and Sunni, or between different socialists groups. We could hope though that rationalists are better in cooperation.
- Gurkenglas 23 Dec 2018 16:56 UTC
  1 point
  Parent
  Distillation-amplification, if it works, should only start a war if the amplified human would have wanted that. Agent foundations theorists afaic predict that the first mover in AI has enough strategic advantage that there’ll be nothing worthy of the word war.
  - avturchin 23 Dec 2018 17:27 UTC
    1 point
    Parent
    Ok, so should amplified human try to turn off first the agent-foundation-based project which is going to turn off this human if complete?
    - Gurkenglas 24 Dec 2018 4:19 UTC
      1 point
      Parent
      If the amplified human could take over the world but hasn’t because he’s not evil, and predicts that this other AI system would do such evil, yes.
      
      It’s plausible, though, that the decision theory used by the new AI would tell it to act predictably non-evilly, in order to make the amplified human see this coming and not destroy the new AI before it’s turned on.
      
      Note that this amplified human has thereby already taken over the world in all but name.