The top 6 of the ones in the paper (the ones I think got >90% somewhat or strongly agree, listed below), seem pretty similar to me—are there important reasons people might support one over another?
Pre-deployment risk assessments
Evaluations of dangerous capabilities
Third-party model audits
Red teaming
Pre-training risk assessments
Pausing training of dangerous models
I think 19 ideas got >90% agreement.
I agree the top ideas overlap. I think reasons one might support some over others depend on the details.
The top 6 of the ones in the paper (the ones I think got >90% somewhat or strongly agree, listed below), seem pretty similar to me—are there important reasons people might support one over another?
Pre-deployment risk assessments
Evaluations of dangerous capabilities
Third-party model audits
Red teaming
Pre-training risk assessments
Pausing training of dangerous models
I think 19 ideas got >90% agreement.
I agree the top ideas overlap. I think reasons one might support some over others depend on the details.