I basically agree with this.
Or I’d put 20% chance on us being in the worlds “where superalignment doesn’t require strong technical philosophy”, that’s maybe not very low.
Overall I think the existance of Anthropic is a mild net positive, and the only lab for which this is true (major in the sense of building frontier models).
“the existence of” meaning, if they shut down today or 2 years ago, it would’ve not increased our chance of survival, maybe lowered it.
I’m also somewhat more optimistic about the research they’re doing helping us in the case where alignment is actually hard.
Why not just use resting heartrate? That also has very good empirical backing as a good proxy for overall heatlh, and its much easier to measure.