This post in which I use a probe to remove sleeper agent behavior in a toy model may also be of interest: https://www.col-ex.org/posts/sleeper-agent/
This post in which I use a probe to remove sleeper agent behavior in a toy model may also be of interest: https://www.col-ex.org/posts/sleeper-agent/