My guess is that we wouldn’t actually know with high confidence before (and likely even some time after) things-will-definitely-be-fine.E.g. 3 months after safe ASI people might still be publishing their alignment takes.
Oh, to be clear I’m not sure this is at all actually likely, but I was curious if anyone had explored the possibility conditional on it being likely
My guess is that we wouldn’t actually know with high confidence before (and likely even some time after) things-will-definitely-be-fine.
E.g. 3 months after safe ASI people might still be publishing their alignment takes.
Oh, to be clear I’m not sure this is at all actually likely, but I was curious if anyone had explored the possibility conditional on it being likely