1: It takes longer than a few hours to properly disagree with a post like this. 2: I’m not sure the comments here are an appropriate venue for debating such a disagreement.
I personally have a number of significant, specific disagreements with the post, primarily relating to the predictability and expected outcomes of inner misalignments and the most appropriate way of thinking about agency and value fragility. I’ve linked some comments I’ve made on those topics, but I think a better way to debate these sorts of questions is via a top level post specifically focusing on one area of disagreement.
1: Yeah I guess that’s true. And comments about smaller points are quicker to write up, explaining the fact that we see a bunch of those comments earlier on. But my intuition is that in 24-48 hours those sorts of meatier objections would usually surface.
2: Regardless of whether that is true, I would expect some people to find the OP an appropriate place to debate.
1: It takes longer than a few hours to properly disagree with a post like this.
2: I’m not sure the comments here are an appropriate venue for debating such a disagreement.
I personally have a number of significant, specific disagreements with the post, primarily relating to the predictability and expected outcomes of inner misalignments and the most appropriate way of thinking about agency and value fragility. I’ve linked some comments I’ve made on those topics, but I think a better way to debate these sorts of questions is via a top level post specifically focusing on one area of disagreement.
1: Yeah I guess that’s true. And comments about smaller points are quicker to write up, explaining the fact that we see a bunch of those comments earlier on. But my intuition is that in 24-48 hours those sorts of meatier objections would usually surface.
2: Regardless of whether that is true, I would expect some people to find the OP an appropriate place to debate.