Regarding the 2004 comment, AGI Researcher probably was referring to the Coherent Extrapolated Volition document which was marked by Eliezer as slightly obsolete in 2004, and not a word since about any progress in the theory of Friendliness.
Robin, if you grant that a “hard takeoff” is possible, that leads to the conclusion that it will eventually be likely (humans being curious and inventive creatures). This AI would “rule the world” in the sense of having the power to do what it wants. Now, suppose you get to pick what it wants (and program that in). What would you pick? I can see arguing with the feasibility of hard takeoff (I don’t buy it myself), but if you accept that step, Eliezer’s intentions seem correct.
Regarding the 2004 comment, AGI Researcher probably was referring to the Coherent Extrapolated Volition document which was marked by Eliezer as slightly obsolete in 2004, and not a word since about any progress in the theory of Friendliness.
Robin, if you grant that a “hard takeoff” is possible, that leads to the conclusion that it will eventually be likely (humans being curious and inventive creatures). This AI would “rule the world” in the sense of having the power to do what it wants. Now, suppose you get to pick what it wants (and program that in). What would you pick? I can see arguing with the feasibility of hard takeoff (I don’t buy it myself), but if you accept that step, Eliezer’s intentions seem correct.