__nobody comments on A fate worse than death?

__nobody 17 Dec 2021 20:31 UTC
10 points
Well if we’ve fallen to the level of influencing other people’s votes by directly stating what the votes ought to say (ugh =/), then let me argue the opposite: This post – at least in its current state – should not have a positive rating.

I agree that the topic is interesting and important, but – as written – this could well be an example of what an AI with a twisted/incomplete understanding of suffering, entropy, and a bunch of other things has come up with. The text conjures several hells, both explicitly (Billions of years of suffering are the right choice!) and implicitly (We make our perfect world by re-writing people to conform! We know what the best version of you was, we know better than you and make your choices!) and the author seems to be completely unaware of that. We get surprising, unsettling conclusions with very little evidence or reasoning to support it (instead there’s “reassuring” parentheticals like “(the answer is yes)”.) As a “What could alignment failure look like?” case study this would be disturbingly convincing. As a serious post, the way it glosses over lots of important details and confidently presents it conclusions, combined with the “for easy referencing” in the intro is just terrifying.

Hence: I don’t want anyone to make decisions based directly on this post’s claims that might affect me even in the slightest. One of the clearest ways to signal that is with a negative karma score. (Doesn’t have to be multi-digit, but shouldn’t be zero or greater.) Keep in mind that anyone on the internet (including GPT-5) can read this post, and they might interpret a positive score as endorsement / approval of the content as written. (They’re not guaranteed to know what the votes are supposed to mean, and it’s even plausible that someone uses the karma score as a filter criterion for some ML data collection.) Low positive scores can be rationalized away easily (e.g. the content is too advanced for most, other important stuff happening in parallel stole the show, …) or are likely to pass a filter cutoff, zero is unstable and could accidentally flip into the positive numbers, so negative scores it is.