I think the fuller context,
Anthropic has put WAY more effort into safety, way way more effort into making sure there are really high standards for safety and that there isn’t going to be danger what these AIs are doing
implies it’s just the amount of effort is larger than other companies (which I agree with), and not the Youtuber believing they’ve solved alignment or are doing enough, see:
but he’s also a realist and is like “AI is going to really potentially fuck up our world”
and
But he’s very realistic. There is a lot of bad shit that is going to happen with AI. I’m not denying that at all.
So I’m not confident that it’s “giving people a false impression of how good we are doing on actually making things safe.” in this case.
I do know DougDoug has recommended Anthropic’s Alignment Faking paper to another youtuber, which is more of a “stating a problem” paper than saying they’ve solved it.
I didn’t either, but on reflection it is!
I did change the post based off your comment, so thanks!