Max H comments on Where do you lie on two axes of world manipulability?

Max H 26 May 2023 15:13 UTC
1 point
0
That does seem like a good axis for identifying cruxes of takeover risk. Though I think “how hard is world takeover” is mostly a function of the first two axes? If you think there are lots of tasks (e.g. creating a digital dictatorship, or any subtasks thereof) which are both possible and tractable, then you’ll probably end up pretty far along the “vulnerable” axis.

I also think the two axes alone are useful for identifying differences in world models, which can help to identify cruxes and interesting research or discussion topics, apart from any implications those different world models have for AI takeover risk or anything else to do with AI specifically.
If you think, for example, that nanotech is relatively tractable, that might imply that you think there are promising avenues for anti-aging or other medical research that involve nanotech, AI-assisted or not.
- anithite 26 May 2023 20:03 UTC
  4 points
  3
  Parent
  Though I think “how hard is world takeover” is mostly a function of the first two axes?
  
  I claim almost entirely orthogonal. Examples of concrete disagreements here are easy to find once you go looking:
  - If AGI tries to take over the world everyone will coordinate to resist
  - Existing computer security works
  - Existing physical security works
  I claim these don’t reduce cleanly to the form “It is possible to do [x]” because at a high level, this mostly reduces to “the world is not on fire because:”
  - existing security measures prevent effectively (not vulnerable world)
  vs.
  - existing law enforcement discourages effectively (vulnerable world)
  - existing people are mostly not evil (vulnerable world)
  There is some projection onto the axis of “how feasible are things” where we don’t have very good existence proofs.
  - can an AI convince humans to perform illegal actions
  - can an AI write secure software to prevent a counter coup
  - etc.
  These are all much much weaker than anything involving nanotechnology or other “indistinguishable from magic” scenarios.
  
  And of course Meta makes everything worse. There was a presentation at Blackhat or Defcon by one of their security guys about how it’s easier to go after attackers than close security holes. In this way they contribute to making the world more vulnerable. I’m having trouble finding it though.