I (a member of Team Shard) think that there are plausible worlds where we don’t address any of the “core difficulties” of alignment, but survive anyways, just because making an AI aligned enough that it doesn’t literally want to kill everyone turns out to be a very low bar.
I (a member of Team Shard) think that there are plausible worlds where we don’t address any of the “core difficulties” of alignment, but survive anyways, just because making an AI aligned enough that it doesn’t literally want to kill everyone turns out to be a very low bar.
Great, glad to know it isn’t universal.
Same, and I think that this kind of “just don’t do anything actively stupid” world is actually reasonably likely (10%+).