MiguelDev comments on LoRA Fine-tuning Efficiently Undoes Safety Training from Llama 2-Chat 70B

MiguelDev 15 Oct 2023 4:57 UTC
1 point
0
I’m exploring a path where AI systems can effectively use harmful technical information present in their training data. I believe that AI systems need to be aware of potential harm in order to protect themselves from it. We just need to figure out how to teach them this.