Tamsin Leake comments on Tamsin Leake’s Shortform

Tamsin Leake 1 Jun 2024 9:00 UTC
6 points
1
Some people who are very concerned about suffering might be considering building an unaligned AI that kills everyone just to avoid the risk of an AI takeover by an AI aligned to values which want some people to suffer.

Let this be me being on the record saying: I believe the probability of {alignment to values that strongly diswant suffering for all moral patients} is high enough, and the probability of {alignment to values that want some moral patients to suffer} is low enough, that this action is not worth it.

I think this applies to approximately anyone who would read this post, including heads of major labs in case they happen to read this post and in case they’re pursuing the startegy of killing everyone to reduce S-risk.

See also: how acausal trade helps in 1, 2, but I think I think this even without acausal trade.
What links here?
- quila's comment on quila’s Shortform by quila (1 Jun 2024 10:14 UTC; 13 points)
- quila 1 Jun 2024 10:15 UTC
  3 points
  0
  Parent
  I’ve replied to/written my current beliefs about this subject here