“the agent would lack a nuanced understanding of what we consider terrible”—isn’t it the whole narrative for Eliezer’s genie tales? While having #2 as a separate request is good, failure to follow #1 can still be catastrophic enough because computers think faster, so our formal “staying in control” may not matter enough.
“the agent would lack a nuanced understanding of what we consider terrible”—isn’t it the whole narrative for Eliezer’s genie tales? While having #2 as a separate request is good, failure to follow #1 can still be catastrophic enough because computers think faster, so our formal “staying in control” may not matter enough.