Hmm. Perhaps the thing I’d endorse is more [include this in every detailed statement about policy/regulation], rather than [shout it from the rooftops].
So, for example, if the authors agree with the statement, I think this should be in:
ARC Evals’ RSP post.
Every RSP.
Proposals for regulation.
...
I’m fine if we don’t start printing it on bumper stickers.
The outcome I’m interested in is something like: every person with significant influence on policy knows that this is believed to be a good/ideal solution, and that the only reasons against it are based on whether it’s achievable in the right form.
If ARC Evals aren’t saying this, RSPs don’t include it, and many policy proposals don’t include it..., then I don’t expect this to become common knowledge. We’re much less likely to get a stop if most people with influence don’t even realize it’s the thing that we’d ideally get.
Hmm. Perhaps the thing I’d endorse is more [include this in every detailed statement about policy/regulation], rather than [shout it from the rooftops].
So, for example, if the authors agree with the statement, I think this should be in:
ARC Evals’ RSP post.
Every RSP.
Proposals for regulation.
...
I’m fine if we don’t start printing it on bumper stickers.
The outcome I’m interested in is something like: every person with significant influence on policy knows that this is believed to be a good/ideal solution, and that the only reasons against it are based on whether it’s achievable in the right form.
If ARC Evals aren’t saying this, RSPs don’t include it, and many policy proposals don’t include it..., then I don’t expect this to become common knowledge.
We’re much less likely to get a stop if most people with influence don’t even realize it’s the thing that we’d ideally get.