Greg C comments on MIRI announces new “Death With Dignity” strategy

Greg C 2 Apr 2022 22:42 UTC
1 point
0
A list of potential miracles (including empirical “crucial considerations” [/wishful thinking] that could mean the problem is bypassed):
- Possibility of a failed (unaligned) takeoff scenario where the AI fails to model humans accurately enough (i.e. realise smart humans could detect its “hidden” activity in a certain way). [This may only set things back a few months to years; or could lead to some kind of Butlerian Jihad if there is a sufficiently bad (but ultimately recoverable) global catastrophe (and then much more time for Alignment the second time around?)].
- Valence realism being true. Binding problem vs AGI Alignment.
  - Omega experiencing every possible consciousness and picking the best? [Could still lead to x-risk in terms of a Hedonium Shockwave].
- Moral Realism being true (and the AI discovering it and the true morality being human-compatible).
- Natural abstractions leading to Alignment by Default?
- Rohin’s links here.
- AGI discovers new physics and exits to another dimension (like the creatures in Greg Egan’s Crystal Nights).
- Simulation/anthropics stuff.
- Alien Information Theory being true!? (And the aliens having solved alignment).
What links here?
- Greg_Colbourn ⏸️ 's comment on 13 ideas for new Existential Risk Movies & TV Shows – what are your ideas? by HaydnBelfield (EA Forum; 12 Apr 2022 21:47 UTC; 5 points)