Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Simon Lermen comments on
LoRA Fine-tuning Efficiently Undoes Safety Training from Llama 2-Chat 70B
Simon Lermen
3 Nov 2023 12:34 UTC
1
point
0
There is a paper out on the exact phenomenon you noticed:
https://arxiv.org/abs/2310.03693
Back to top
There is a paper out on the exact phenomenon you noticed:
https://arxiv.org/abs/2310.03693