Paperclipping is a relatively simple failure. The difference between paperclipping and evil is mainly just that—a matter of complexity. Evil is complex, turning the universe into tuna is decidedly not.
On the scale of friendliness, I ironically see an “evil” failure (meaning, among other things, that we’re still in some sense around to notice it being evil) becoming more likely as friendliness increases. As we try to implement our own values, failures become more complex, and less likely to be total—thus letting us stick around to see them.
Paperclipping is a relatively simple failure. The difference between paperclipping and evil is mainly just that—a matter of complexity. Evil is complex, turning the universe into tuna is decidedly not.
On the scale of friendliness, I ironically see an “evil” failure (meaning, among other things, that we’re still in some sense around to notice it being evil) becoming more likely as friendliness increases. As we try to implement our own values, failures become more complex, and less likely to be total—thus letting us stick around to see them.