Chris_Leong comments on RAND report finds no effect of current LLMs on viability of bioterrorism attacks

Chris_Leong 26 Jan 2024 2:56 UTC
22 points
16
Having just read through this, one key point that I haven’t seen people mentioning is that the results are for LLM’s that need to be jail-broken.
So these results are more relevant to the release of a model over an API rather than open-source, where you’d just fine-tune away the safeguards or download a model without safeguards in the first place.