What is “killing”? What is “harming”? What is “jeopardizing”? What is “living”? What is “human”? What is the difference between “I cause future killing/harming/jeopardizing” and “future killing/harming/jeopardizing will be in my lightcone”? How to explain all of this to AI? How to check if it understood everything correctly?
What is “killing”? What is “harming”? What is “jeopardizing”? What is “living”? What is “human”? What is the difference between “I cause future killing/harming/jeopardizing” and “future killing/harming/jeopardizing will be in my lightcone”? How to explain all of this to AI? How to check if it understood everything correctly?
We don’t know.