Long before goal mutation is a problem malformed constraints become a problem.
True. But we are not arguing about what is a bigger (or earlier) problem. I’m being told that an AI can not, absolutely can NOT change its original goals (or terminal values). And that looks very handwavy to me.
True. But we are not arguing about what is a bigger (or earlier) problem. I’m being told that an AI can not, absolutely can NOT change its original goals (or terminal values). And that looks very handwavy to me.