Having just read through this, one key point that I haven’t seen people mentioning is that the results are for LLM’s that need to be jail-broken.
So these results are more relevant to the release of a model over an API rather than open-source, where you’d just fine-tune away the safeguards or download a model without safeguards in the first place.
Having just read through this, one key point that I haven’t seen people mentioning is that the results are for LLM’s that need to be jail-broken.
So these results are more relevant to the release of a model over an API rather than open-source, where you’d just fine-tune away the safeguards or download a model without safeguards in the first place.