Gurkenglas comments on Let Values Drift

Gurkenglas 21 Jun 2019 1:32 UTC
3 points
Human values evolve in human ways. A priori, an AI’s value drift would almost surely take it in alien, worthless-to-us directions. A non-evolving AI sounds easier to align—we only need to hit the human-aligned region of valuespace once instead of needing to keep hitting it.