Wei Dai comments on You can, in fact, bamboozle an unaligned AI into sparing your life

Wei Dai 30 Sep 2024 3:52 UTC
25 points
19
I have a slightly different take, which is that we can’t commit to doing this scheme even if we want to, because I don’t see what we can do today that would warrant the term “commitment”, i.e., would be binding on our post-singularity selves.

In either case (we can’t or don’t commit), the argument in the OP loses a lot of its force, because we don’t know whether post-singularity humans will decide to do this kind scheme or not.
- avturchin 30 Sep 2024 11:32 UTC
  4 points
  −17
  Parent
  Young unaligned AI will also not know if post-singularity humans will follow the commitment, so it will estimate its chances as 0.5, and in this case, the young AI will still want to follow the deal.