Akash comments on DeepMind’s “Frontier Safety Framework” is weak and unambitious

Akash 18 May 2024 19:04 UTC
2 points
0
I agree with ~all of your subpoints but it seems like we disagree in terms of the overall appraisal.

Thanks for explaining your overall reasoning though. Also big +1 that the internal deployment stuff is scary. I don’t think either lab has told me what protections they’re going to use for internally deploying dangerous (~ASL-4) systems, but the fact that Anthropic treats internal deployment like external deployment is a good sign. OpenAI at least acknowledges that internal deployment can be dangerous through its distinction between high risk (can be internally deployed) and critical risk (cannot be), but I agree that the thresholds are too high, particularly for model autonomy.