All other things being equal, it seem safer to try to align AI with humans which are self-aligned.
For what it’s worth I’ve concluded something similar and it’s part of why I’m spending a decent chunk of my time trying to become such a person, although thankfully the process has plenty of other reasons to recommend it so this is just one of many reasons why I’m doing that.
For what it’s worth I’ve concluded something similar and it’s part of why I’m spending a decent chunk of my time trying to become such a person, although thankfully the process has plenty of other reasons to recommend it so this is just one of many reasons why I’m doing that.