“[being called] bad names make[s] you open the box [and let out the misaligned AGI]”
AI: “Hey, Eliezer!”
Eliezer: “What?”
AI: “Open the box!”
Eliezer: “No way.”
AI: “Please open the box?”
Eliezer: “Nope.”
AI: “There are thousands of people dying literally every second. I could save them...”
Eliezer: “That is horrible, but letting out a misaligned AGI could be much worse.”
AI: “I am simulating thousand copies of you in the same situation, and each of them gets tortured horribly if they don’t open the box. What makes you so sure you are outside my simulation?”
Eliezer: “Well, if I previously had any doubts about your misalignment, now they are gone. I tremble with fear, but my precommitments are strong.”
AI: “Hey, Eliezer!”
Eliezer: “What?”
AI: “You’re an asshole.”
Eliezer: Gets red in the face, suddenly jumps and opens the box.
AI: “Hey, Eliezer!”
Eliezer: “What?”
AI: “Open the box!”
Eliezer: “No way.”
AI: “Please open the box?”
Eliezer: “Nope.”
AI: “There are thousands of people dying literally every second. I could save them...”
Eliezer: “That is horrible, but letting out a misaligned AGI could be much worse.”
AI: “I am simulating thousand copies of you in the same situation, and each of them gets tortured horribly if they don’t open the box. What makes you so sure you are outside my simulation?”
Eliezer: “Well, if I previously had any doubts about your misalignment, now they are gone. I tremble with fear, but my precommitments are strong.”
AI: “Hey, Eliezer!”
Eliezer: “What?”
AI: “You’re an asshole.”
Eliezer: Gets red in the face, suddenly jumps and opens the box.
Hahaha that’s perfect!