I do think that some term needs to refer to this problem, to separate it from other problems like “understanding what humans want,” “solving philosophy,” etc.
Worth noting here that (it looks like) Paul eventually settled upon “intent alignment” as the term for this.
Worth noting here that (it looks like) Paul eventually settled upon “intent alignment” as the term for this.