Any argument along the lines of “humanity is generally non-friendly” shows a generally pessimistic view of human-nature (just an observation)
I found this labelling distracting. Especially since when we are talking about “Friendly AI” humans are not even remotely friendly in the relevant sense. It isn’t anything to do with ‘pessimism’. Believing that humans are friendly in that sense would be flat out wrong.
I like the idea of the sandbox as a purely additional measure. But I wouldn’t remotely consider it safe. Not just because a superintelligence may find a bug in the system. Because humans are not secure. I more or less assume that the AI will find a way to convince the creators to release it into the ‘real world’.
Especially since when we are talking about “Friendly AI” humans are not even remotely friendly in the relevant sense
Point taken—Friendliness for an AI is a much higher standard than even idealized human morality. Fine. But to get to that Friendliness, you need to define CEV in the first place, so improving humans and evolving them forward is a route towards that.
But again I didn’t mean to imply we need to create perfect human-sims. Not even close. This is an additional measure.
I more or less assume that the AI will find a way to convince the creators to release it into the ‘real world’.
This is an unreasonable leap of faith if the AI doesn’t even believe that there are ‘creators’ in the first place.
I found this labelling distracting. Especially since when we are talking about “Friendly AI” humans are not even remotely friendly in the relevant sense. It isn’t anything to do with ‘pessimism’. Believing that humans are friendly in that sense would be flat out wrong.
I like the idea of the sandbox as a purely additional measure. But I wouldn’t remotely consider it safe. Not just because a superintelligence may find a bug in the system. Because humans are not secure. I more or less assume that the AI will find a way to convince the creators to release it into the ‘real world’.
Point taken—Friendliness for an AI is a much higher standard than even idealized human morality. Fine. But to get to that Friendliness, you need to define CEV in the first place, so improving humans and evolving them forward is a route towards that.
But again I didn’t mean to imply we need to create perfect human-sims. Not even close. This is an additional measure.
This is an unreasonable leap of faith if the AI doesn’t even believe that there are ‘creators’ in the first place.
Do you believe there are creators?