Okay, that’s fair enough.
In the context of Superintelligence, though, in Table 11 a Tool AI is defined thusly: ‘Tool: A system not designed to exhibit goal-directed behaviour.’ I am responding directly to that. But it sounds as though you would object to Bostrom’s characterisation of tool proposals.
In Bostrom’s parlance, I think your proposals for Tool AI would be described as (1) Oracle AI + stunting and (2) Oracle AI + boxing—the energy thing is interesting. I’m hopeful they would be safe, but I’m not convinced it would take much energy to pose an existential threat.
I also disagree with that false conclusion, but I would probably say that ‘goals are dangerous’ is the false premise. Goals are dangerous when, well, they actually are dangerous (to my life or yours,) and when they are attached to sufficient optimising power, as you get at in your last paragraph.
I think the line of argumentation Bostrom is taking here is that superintelligence by definition has a huge amount of optimisation power, so whether it is dangerous to us is reduced to whether its goals are dangerous to us.
(Happy New Year!)