I don’t think I understand / buy this “race to the top idea”:
If adopted as a standard across frontier labs, we hope this might create a “race to the top” dynamic where competitive incentives are directly channeled into solving safety problems.
Some specific questions I have:
This sounds great, but what does it actually mean?
What’s a reasonable story for how this race to the top plays out?
Are there historical case-studies of successful “races to the top” that the RSP is trying to emulate (or that can be looked to for reference)? There’s a bunch of related stuff, but it’s unclear to me what the best historical reference is.
What’s the main incentive pushing for better safety? Customers demanding it? Outside regulators? The company wanting to scale and needing to meet (mostly internally-evaluated) safety thresholds first (this seems like the main mechanism in the RSP, while the others seem to have stronger historical precedent)?
To be clear, I don’t think a historical precedent of similar things working is necessary for the RSP to be net positive, but it is (I think) a crux for how likely I believe this is to succeed.
Thanks for posting, Zac!
I don’t think I understand / buy this “race to the top idea”:
Some specific questions I have:
This sounds great, but what does it actually mean?
What’s a reasonable story for how this race to the top plays out?
Are there historical case-studies of successful “races to the top” that the RSP is trying to emulate (or that can be looked to for reference)? There’s a bunch of related stuff, but it’s unclear to me what the best historical reference is.
What’s the main incentive pushing for better safety? Customers demanding it? Outside regulators? The company wanting to scale and needing to meet (mostly internally-evaluated) safety thresholds first (this seems like the main mechanism in the RSP, while the others seem to have stronger historical precedent)?
To be clear, I don’t think a historical precedent of similar things working is necessary for the RSP to be net positive, but it is (I think) a crux for how likely I believe this is to succeed.