You should publicly confirm that your old policy don’t meaningfully advance the frontier with a public launch has been replaced by your RSP, if that’s true, and otherwise clarify your policy.
You take credit for the LTBT (e.g. here) but you haven’tpublished enough to show that it’s effective. You should publish the Trust Agreement, clarify these ambiguities, and make accountability-y commitments like if major changes happen to the LTBT we’ll quickly tell the public.
(Reminder that a year ago you committed to establish a bug bounty program (for model issues) or similar but haven’t. But I don’t think bug bounties are super important.)
[Edit: bug bounties are also mentioned in your RSP—in association with ASL-2—but not explicitly committed to.]
(Sidenote: it seems Sam was kind of explicitly asking to be pressured, so your comment seems legit :) But I also think that, had Sam not done so, I would still really appreciate him showing up and responding to Oli’s top-level post, and I think it should be fine for folks from companies to show up and engage with the topic at hand (NDAs), without also having to do a general AMA about all kinds of other aspects of their strategy and policies. If Zach’s questions do get very upvoted, though, it might suggest there’s demand for some kind of Anthropic AMA event.)
OK:
You should publicly confirm that your old policy don’t meaningfully advance the frontier with a public launch has been replaced by your RSP, if that’s true, and otherwise clarify your policy.
You take credit for the LTBT (e.g. here) but you haven’t published enough to show that it’s effective. You should publish the Trust Agreement, clarify these ambiguities, and make accountability-y commitments like if major changes happen to the LTBT we’ll quickly tell the public.
(Reminder that a year ago you committed to establish a bug bounty program (for model issues) or similar but haven’t. But I don’t think bug bounties are super important.)
[Edit: bug bounties are also mentioned in your RSP—in association with ASL-2—but not explicitly committed to.]
(Good job in many areas.)
(Sidenote: it seems Sam was kind of explicitly asking to be pressured, so your comment seems legit :)
But I also think that, had Sam not done so, I would still really appreciate him showing up and responding to Oli’s top-level post, and I think it should be fine for folks from companies to show up and engage with the topic at hand (NDAs), without also having to do a general AMA about all kinds of other aspects of their strategy and policies. If Zach’s questions do get very upvoted, though, it might suggest there’s demand for some kind of Anthropic AMA event.)