I’d agree that the arguments I raise could be addressed (as endless arguments attest) and OP could reasonably end up with a thesis like “LLMs are actually human aligned by default”. Putting my recommendation differently, the lack of even a gesture towards those arguments almost caused me to dismiss the post as unserious and not worth finishing.
I’m somewhat surprised, given OP’s long LW tenure. Maybe this was written for a very different audience and just incidentally posted to LW? Except the linkpost tagline focuses on the 1st part of the post, not the 2nd, implying OP thought this was actually persuasive?! Is OP failing an intellectual Turing test or am I???
I agree with you that it is quite bad that Roko didn’t attempt to do this, and my steelmanning doesn’t change the fact that the original argument is quite bad, and should be shored up.
I’d agree that the arguments I raise could be addressed (as endless arguments attest) and OP could reasonably end up with a thesis like “LLMs are actually human aligned by default”. Putting my recommendation differently, the lack of even a gesture towards those arguments almost caused me to dismiss the post as unserious and not worth finishing.
I’m somewhat surprised, given OP’s long LW tenure. Maybe this was written for a very different audience and just incidentally posted to LW? Except the linkpost tagline focuses on the 1st part of the post, not the 2nd, implying OP thought this was actually persuasive?! Is OP failing an intellectual Turing test or am I???
I agree with you that it is quite bad that Roko didn’t attempt to do this, and my steelmanning doesn’t change the fact that the original argument is quite bad, and should be shored up.