Seth Herd comments on If we solve alignment, do we die anyway?

Seth Herd 25 Aug 2024 0:22 UTC
4 points
2
You have to specify the right thing for whom. And the AGI won’t know what it is for sure, in a realistic slow takeoff during the critical risk period. See my reply to Charlie above.

But yes, using the AGIs intelligence to help you issue good instrctions is definitely a good idea. See my Instruction-following AGI is easier and more likely than value aligned AGI for more logic on why.
- Ann 25 Aug 2024 1:07 UTC
  −1 points
  0
  Parent
  All non-omniscient agents make decisions with incomplete information. I don’t think this will change at any level of takeoff.
  - Seth Herd 25 Aug 2024 19:33 UTC
    4 points
    2
    Parent
    Sure, but my point here is that AGI will be only weakly superhuman during the critical risk period, so it will be highly uncertain, and probably human judgment is likely to continue to play a large role. Quite possibly to our detriment.