The keywords for this concern are s-risk / astronomical suffering. I think this is unlikely, since a wrapper-mind that would pursue suffering thereby cares about human-specific concerns, which requires alignment-structure. So more likely this either isn’t systematically pursued (other than as incidental mindcrime, which allows suffering but doesn’t optimize for it), or we get full alignment (for whatever reason).
The keywords for this concern are s-risk / astronomical suffering. I think this is unlikely, since a wrapper-mind that would pursue suffering thereby cares about human-specific concerns, which requires alignment-structure. So more likely this either isn’t systematically pursued (other than as incidental mindcrime, which allows suffering but doesn’t optimize for it), or we get full alignment (for whatever reason).