Doesn’t releasing the weights inherently involve releasing the architecture (unless you’re using some kind of encrypted ML)? A closed-source model could release the architecture details as well, but one step at a time. Just to be clear, I’m trying to push things towards a policy that makes sense going forward and so even if what you said about not providing any interesting architectural insight is true, I still think we need to push these groups to defining a point at which they’re going to stop releasing open models.
Doesn’t releasing the weights inherently involve releasing the architecture (unless you’re using some kind of encrypted ML)? A closed-source model could release the architecture details as well, but one step at a time. Just to be clear, I’m trying to push things towards a policy that makes sense going forward and so even if what you said about not providing any interesting architectural insight is true, I still think we need to push these groups to defining a point at which they’re going to stop releasing open models.