Eliezer Yudkowsky comments on Does davidad’s uploading moonshot work?

Eliezer Yudkowsky 3 Nov 2023 21:19 UTC
19 points
2
I currently guess that a research community of non-upgraded alignment researchers with a hundred years to work, picks out a plausible-sounding non-solution and kills everyone at the end of the hundred years.
What links here?
- Upgrading the AI Safety Community by trevor (16 Dec 2023 15:34 UTC; 42 points)
- Bogdan Ionut Cirstea 16 Oct 2024 9:40 UTC
  2 points
  0
  Parent
  I’m highly confused about what is meant here. Is this supposed to be about the current distribution of alignment researchers? Is this also supposed to hold for e.g. the top 10 percentile of alignment researchers, e.g. on safety mindset? What about many uploads/copies of Eliezer only?
- Simon Fischer 15 Nov 2023 12:22 UTC
  1 point
  0
  Parent
  I’m confused by this statement. Are you assuming that AGI will definitely be built after the research time is over, using the most-plausible-sounding solution?
  Or do you believe that you understand NOW that a wide variety of approaches to alignment, including most of those that can be thought of by a community of non-upgraded alignment researchers (CNUAR) in a hundred years, will kill everyone and that in a hundred years the CNUAR will not understand this?
  If so, is this because you think you personally know better or do you predict the CNUAR will predictably update in the wrong direction? Would it matter if you got to choose the composition of the CNUAR?