I would like to see more posts about reasoning and fewer, despite my transhumanist sympathies, about boxed AIs, hypothetical torture scenarios, and the optimality of donating to the Friendly AI cause
One of these things is not like the others. One of these things is not about the topic which historically could not be named. One of them is just a building block that can be sometimes useful when discussing reasoning that involves decision making.
My objection to that one is slightly different, yes. But I think it does derive from the same considerations of vast utility/disutility that drive the historically forbidden topic, and is subject to some of the same pitfalls (as well as some others less relevant here).
There are also a few specific torture scenarios which are much more closely linked to the historically forbidden topic, and which come up, however obliquely, with remarkable frequency.
There are also a few specific torture scenarios which are much more closely linked to the historically forbidden topic, and which come up, however obliquely, with remarkable frequency.
Hmm...
Roko’s Basilisk
Boxed AI trying to extort you
The ’People Are Jerks” failure mode of CEV
I can’t think of any other possible examples off the top of my head. were these the ones you were thinking of?
Also Pascal’s mugging (though I suppose how closely related that is to the HFT depends on where you place the emphasis) and a few rarer variations, but you’ve hit the main ones.
One of these things is not like the others. One of these things is not about the topic which historically could not be named. One of them is just a building block that can be sometimes useful when discussing reasoning that involves decision making.
My objection to that one is slightly different, yes. But I think it does derive from the same considerations of vast utility/disutility that drive the historically forbidden topic, and is subject to some of the same pitfalls (as well as some others less relevant here).
There are also a few specific torture scenarios which are much more closely linked to the historically forbidden topic, and which come up, however obliquely, with remarkable frequency.
Hmm...
Roko’s Basilisk
Boxed AI trying to extort you
The ’People Are Jerks” failure mode of CEV
I can’t think of any other possible examples off the top of my head. were these the ones you were thinking of?
Also Pascal’s mugging (though I suppose how closely related that is to the HFT depends on where you place the emphasis) and a few rarer variations, but you’ve hit the main ones.