If you want to better understand counting arguments for deceptive alignment, my comment here might be a good place to start.
If you want to better understand counting arguments for deceptive alignment, my comment here might be a good place to start.