avturchin answers p(s-risks to contemporary humans)?

avturchin 9 Feb 2025 19:31 UTC
2 points
0
Content warning – the idea below may increase your subjective estimation of personal s-risks.

If there is at least one aligned AI, other AIs may have an incentive to create s-risks for currently living humans – in order to blackmail the aligned AI. Thus, s-risk probabilities depend on the likelihood of a multipolar scenario.
- MattAlexander 9 Feb 2025 23:07 UTC
  3 points
  0
  Parent
  Makes sense. What probability do you place on this? It would require solving alignment, a second AI being created before the first can create a singleton, and then the misaligned AI choosing this kind of blackmail over other possible tactics. If the blackmail involves sentient simulations (as is sometimes suggested, although not in your comment), it would seem that the misaligned AI would have to solve the hard problem of consciousness and be able to prove this to the other AI (not a valid blackmail if the simulations are not known to be sentient).
  - avturchin 10 Feb 2025 19:56 UTC
    2 points
    0
    Parent
    I think it becomes likely in a multipolar scenario with 10-100 Als.
    
    One thing to take into account is that other AIs will consider such risk and keep their real preferences secret. This means that which AIs are aligned will be unknowable both for humans and for other AIs