The “surgical model edits” section should also have a subsection on editing model weights. For example there’s this paper on removing knowledge from models using multi-objective weight masking.
Yep, no idea how I forgot this. concept erasure!
The “surgical model edits” section should also have a subsection on editing model weights. For example there’s this paper on removing knowledge from models using multi-objective weight masking.
Yep, no idea how I forgot this. concept erasure!