Steven Byrnes comments on [AN #63] How architecture search, meta learning, and environment design could lead to general intelligence

Steven Byrnes 15 Sep 2019 17:51 UTC
1 point
What you say about is/ought is basically the alignment problem, right? My take is: I have high confidence that future AIs will know intellectually what it is that humans regard as common-sense morality, since that knowledge is instrumentally useful for any goal involving predicting or interacting with humans. I have less confidence that we’ll figure out how to ensure that those AIs adopt human common-sense morality. Even humans, who probably have an innate drive to follow societal norms, will sometimes violate norms anyway, or do terrible things in a way that works around those constraints.