gallabytes comments on On Anthropic’s Sleeper Agents Paper

gallabytes 18 Jan 2024 18:28 UTC
1 point
0
Order matters more at smaller scales—if you’re training a small model on a lot of data and you sample in a sufficiently nonrandom manner, you should expect catastrophic forgetting to kick in eventually, especially if you use weight decay.