Eliezer Yudkowsky comments on MIRI announces new “Death With Dignity” strategy

Eliezer Yudkowsky 2 Apr 2022 6:02 UTC
30 points
0
I think that people who imagine “tracking progress using signals integrated with other signals” feels anything like happiness feels inside to them—while taking that imagination and also loudly insisting that it will be very alien happiness or much simpler happiness or whatever—are simply making a mistake-of-fact, and I am just plain skeptical that there is a real values difference that would survive their learning what I know about how minds and qualia work. I of course fully expect that these people will loudly proclaim that I could not possibly know anything they don’t, despite their own confusion about these matters that they lack the skill to reflect on as confusion, and for them to exchange some wise smiles about those silly people who think that people disagree because of mistakes rather than values differences.
Trade opportunities are unfortunately ruled out by our inability to model those minds well enough that, if some part of them decided to seize an opportunity to Defect, we would’ve seen it coming in the past and counter-Defected. If we Cooperate, we’ll be nothing but CooperateBot, and they, I’m afraid, will be PrudentBot, not FairBot.
What links here?
- Adele Lopez's comment on Adele Lopez’s Shortform by Adele Lopez (22 Apr 2022 5:44 UTC; 7 points)
- Rafael Harth's comment on MIRI announces new “Death With Dignity” strategy by Eliezer Yudkowsky (5 Apr 2022 10:53 UTC; 3 points)
- orthonormal 3 Apr 2022 7:16 UTC
  76 points
  Parent
  and they, I’m afraid, will be PrudentBot, not FairBot.
  This shouldn’t matter for anyone besides me, but there’s something personally heartbreaking about seeing the one bit of research for which I feel comfortable claiming a fraction of a point of dignity, being mentioned validly to argue why decision theory won’t save us.
  (Modal bargaining agents didn’t turn out to be helpful, but given the state of knowledge at that time, it was worth doing.)
  What links here?
  - anonce's comment on Negotiating Up and Down the Simulation Hierarchy: Why We Might Survive the Unaligned Singularity by David Udell (4 May 2022 18:18 UTC; 7 points)
  - Eliezer Yudkowsky 3 Apr 2022 8:20 UTC
    58 points
    Parent
    Sorry.
    It would be dying with a lot less dignity if everyone on Earth—not just the managers of the AGI company making the decision to kill us—thought that all you needed to do was be CooperateBot, and had no words for any sharper concepts than that. Thank you for that, Patrick.
    But sorry anyways.
- Signer 2 Apr 2022 15:26 UTC
  1 point
  Parent
  To clarify, you mean “mistake-of-fact” in the sense that maybe the same people would use for other high-level concepts? Because if you use low enough resolution, happiness is like “tracking progress using signals integrated with other signals”, and so it is at least not inconsistent to save this part of your utility function using such low resolution.