Thanks for your reply. In brief response to your more specific points:
On government oversight, I think you’re referring to the quote “providing a regulator the power to oversee model development could also promote regulatory visibility, thus allowing regulations to adapt more quickly.” But the paper doesn’t seem to mention the direct benefit of oversight: verifying compliance and enforcing the rules. Good oversight would result in licensing not being a one-time thing but rather that labs could lose their licenses during a training run if they were noncompliant. (In my community ‘oversight of training runs’ means government auditors verifying compliance and the government stopping noncompliant runs; maybe it means something weaker outside my community.)
I agree that “perfect compliance” is hard but stand by my disappointment in the “particularly egregious instances” passage as not aiming high enough,
Thanks for your reply. In brief response to your more specific points:
On government oversight, I think you’re referring to the quote “providing a regulator the power to oversee model development could also promote regulatory visibility, thus allowing regulations to adapt more quickly.” But the paper doesn’t seem to mention the direct benefit of oversight: verifying compliance and enforcing the rules. Good oversight would result in licensing not being a one-time thing but rather that labs could lose their licenses during a training run if they were noncompliant. (In my community ‘oversight of training runs’ means government auditors verifying compliance and the government stopping noncompliant runs; maybe it means something weaker outside my community.)
I agree that “perfect compliance” is hard but stand by my disappointment in the “particularly egregious instances” passage as not aiming high enough,
Edit: also I get that finding consensus is hard but after reading the consensus-y-but-ambitious Towards Best Practices in AGI Safety and Governance and Model evaluation for extreme risks I was expecting consensus on something stronger.
Thanks for the response! I appreciate the clarification on both point 1 and 2 above. I think they’re fair criticisms. Thanks for pointing them out.