We haven’t tried comparing to LEACE yet. You’re right that theoretically it should be more surgical. Although, from our preliminary analysis, it seems like our naive intervention is already pretty surgical (it has minimal impact on CE loss, MMLU). (I also like our methodology is dead simple, and doesn’t require estimating covariance.)
I agree that “orthogonalization” is a bit overloaded. Not sure I like LoRACS though—when I see “LoRA”, I immediately think of fine-tuning that requires optimization power (which this method doesn’t). I do think that “orthogonalizing the weight matrices with respect to direction ^r” is the clearest way of describing this method.
I do think that “orthogonalizing the weight matrices with respect to direction ^r” is the clearest way of describing this method.
I do respectfully disagree here. I think the verb “orthogonalize” is just confusing. I also don’t think the distinction between optimization and no optimization is very important. What you’re actually doing is orthogonally projecting the weight matrices onto the orthogonal complement of the direction.
Thanks!
We haven’t tried comparing to LEACE yet. You’re right that theoretically it should be more surgical. Although, from our preliminary analysis, it seems like our naive intervention is already pretty surgical (it has minimal impact on CE loss, MMLU). (I also like our methodology is dead simple, and doesn’t require estimating covariance.)
I agree that “orthogonalization” is a bit overloaded. Not sure I like LoRACS though—when I see “LoRA”, I immediately think of fine-tuning that requires optimization power (which this method doesn’t). I do think that “orthogonalizing the weight matrices with respect to direction ^r” is the clearest way of describing this method.
I do respectfully disagree here. I think the verb “orthogonalize” is just confusing. I also don’t think the distinction between optimization and no optimization is very important. What you’re actually doing is orthogonally projecting the weight matrices onto the orthogonal complement of the direction.