A Type 2 FAI gets its notion of what morality is based on properties of the physical universe, namely properties of humans in the physical universe. But even if counterfactually there were no humans in the physical universe, or even if counterfactually Omega modified the contents of all human brains in the physical universe so that they optimize for paperclips, that wouldn’t change what actual-me means when actual-me says “I want an FAI to behave morally” even if it might change what counterfactual-me means when counterfactual-me says that.
A Type 2 FAI gets its notion of what morality is based on properties of the physical universe, namely properties of humans in the physical universe. But even if counterfactually there were no humans in the physical universe, or even if counterfactually Omega modified the contents of all human brains in the physical universe so that they optimize for paperclips, that wouldn’t change what actual-me means when actual-me says “I want an FAI to behave morally” even if it might change what counterfactual-me means when counterfactual-me says that.