I got some great constructive feedback from Linda Linsefors (which she gave me permission to share).
On the summary, Linda thinks this is not a good summary. In short, she thinks it highlights some of the weakest parts of the paper, and undersells the most important parts of the paper (eg. survey of impossibility arguments from other academic fields).
Also, that there is too much coverage of generic arguments about AI Safety in the summary. Those arguments make sense in the original post, given the expected audience. But those comments do not make sense for LW.
- E.g. this point at the start: “But the reality is that the chances of misaligned AGI are not small. In fact, in the absence of an effective safety program that is the only outcome we will get. So in reality the statistics look very convincing to support a significant AI safety effort.”
Overall, Linda expects this blogpost to make people less interested in Roman’s work. She is not surprised by the engagement on the post – one comment that has more upvotes than the original post.
I got some great constructive feedback from Linda Linsefors (which she gave me permission to share).
On the summary, Linda thinks this is not a good summary. In short, she thinks it highlights some of the weakest parts of the paper, and undersells the most important parts of the paper (eg. survey of impossibility arguments from other academic fields).
Also, that there is too much coverage of generic arguments about AI Safety in the summary. Those arguments make sense in the original post, given the expected audience. But those comments do not make sense for LW.
- E.g. this point at the start: “But the reality is that the chances of misaligned AGI are not small. In fact, in the absence of an effective safety program that is the only outcome we will get. So in reality the statistics look very convincing to support a significant AI safety effort.”
Overall, Linda expects this blogpost to make people less interested in Roman’s work. She is not surprised by the engagement on the post – one comment that has more upvotes than the original post.