Lone Pine comments on All AGI safety questions welcome (especially basic ones) [Sept 2022]

Lone Pine 5 Oct 2022 10:53 UTC
2 points
1
How do we know that the AI has a correct and reasonable instrumentation of the risk of harming a human? What if the AI has an incorrect definition of human, or deliberately corrupts its definition of human?