TurnTrout comments on Debate on Instrumental Convergence between LeCun, Russell, Bengio, Zador, and More

TurnTrout 4 Oct 2019 20:21 UTC
LW: 27 AF: 8
AF
I don’t know that his arguments “echo”, it’s more like “can be translated into existing discourse”. For example, the leap from his 5) to IDA is massive, and I don’t understand why he imagines tackling the “we can’t align AGIs” problem with “build another AGI to stop the bad AGI”.
- Vaniver 4 Oct 2019 21:11 UTC
  LW: 1 AF: 1
  AF Parent
  I think 5 is much closer to the “look, the first goal is to build a system that prevents anyone else from building unaligned AGI” claim, and there’s a separate claim 6 of the form “more generally, we can use AGI to police AGI” that is similar to debate or IDA. And I think claim 5 is basically in line with what, say, Bostrom would discuss (where stabilization is a thing to do before we attempt to build a sovereign).
  - ESRogs 4 Oct 2019 23:24 UTC
    LW: 10 AF: 6
    AF Parent
    And I think claim 5 is basically in line with what, say, Bostrom would discuss (where stabilization is a thing to do before we attempt to build a sovereign).
    You mean in the sense of stabilizing the whole world? I’d be surprised if that’s what Yann had in mind. I took him just to mean building a specialized AI to be a check on a single other AI.
    - Vaniver 5 Oct 2019 2:45 UTC
      LW: 4 AF: 2
      AF Parent
      That’s how I interpreted:
      the defensive AI systems designed to protect against rogue AI systems are not akin to the military, they are akin to the police, to law enforcement. Their “jurisdiction” would be strictly AI systems, not humans.
      To be clear, I think he would mean it more in the way that there’s currently an international police order that is moderately difficult to circumvent, and that the same would be true for AGI, and not necessarily the more intense variants of stabilization (which are necessarily primarily if you think offense is highly advantaged over defense, which I don’t know his opinion on).