[deleted] comments on Steelmaning AI risk critiques

[deleted] 3 Aug 2015 3:59 UTC
2 points

Okay, so we just have to determine human terminal values in detail, and plug them into a powerful maximizer.

Why do you even go around thinking that the concept of “terminal values”, which is basically just a consequentialist steelmanning Aristotle, cuts reality at the joints?

For starters, you want to be able to prove formally that its goals will remain stable as it self-modifies

That part honestly isn’t that hard once you read the available literature about paradox theorems.