Note that research that has high capabilities externalities is explicitly out of scope:
”Proposals that increase safety primarily as a downstream effect of improving standard system performance metrics unrelated to safety (e.g., accuracy on standard tasks) are not in scope.”
I think the language here is importantly different from placing capabilities externalities as out of scope. It seems to me that it only excludes work that creates safety merely by removing incompetence as measured by standard metrics. For example, it’s not clear to me that this excludes work that improves a model’s situational awareness or that creates tools or insights into how a model works with more application to capabilities than to safety.
I think the language here is importantly different from placing capabilities externalities as out of scope. It seems to me that it only excludes work that creates safety merely by removing incompetence as measured by standard metrics. For example, it’s not clear to me that this excludes work that improves a model’s situational awareness or that creates tools or insights into how a model works with more application to capabilities than to safety.