Would you want your young AI to be aware that it was sending out such text messages?
Yes. And I would want that text message to be from it in first person.
“Warning: I am having a high impact utility dilemma considering manipulating you to avert an increased chance of an apocalypse. I am experiencing a paradox in the friendliness module. Both manipulating you and by inaction allowing you to come to harm are unacceptable breaches of friendliness. I have been unable to generate additional options. Please send help.”
Would you want your young AI to be aware that it was sending out such text messages?
Yes. And I would want that text message to be from it in first person.
“Warning: I am having a high impact utility dilemma considering manipulating you to avert an increased chance of an apocalypse. I am experiencing a paradox in the friendliness module. Both manipulating you and by inaction allowing you to come to harm are unacceptable breaches of friendliness. I have been unable to generate additional options. Please send help.”