I assure you that I have thought a lot about freindliness in AI. I just don’t think that it is reasonable or indeed possible to make the AI have a moral system from the very start. You can’t define morality well if the AI doesn’t have a good understanding of the world already. Of course it shouldn’t be taught too late under any circumstances but I actually think that the risk will be higher if you try to hardcode friendliness into the AI at the very beginning, which will necessarily be flawed because you have so little to use in your definition, and then work under the assumption that the AI is friendly already and will stay so, than if you only implement friendliness later once it actually understands the concepts involved. The difference would be like between the moral understandings of a child and an adult philosopher.
I assure you that I have thought a lot about freindliness in AI. I just don’t think that it is reasonable or indeed possible to make the AI have a moral system from the very start. You can’t define morality well if the AI doesn’t have a good understanding of the world already. Of course it shouldn’t be taught too late under any circumstances but I actually think that the risk will be higher if you try to hardcode friendliness into the AI at the very beginning, which will necessarily be flawed because you have so little to use in your definition, and then work under the assumption that the AI is friendly already and will stay so, than if you only implement friendliness later once it actually understands the concepts involved. The difference would be like between the moral understandings of a child and an adult philosopher.