Oh so when steering the LAT model at layer 4, the model actually generate valid response without refusal?
Oh so when steering the LAT model at layer 4, the model actually generate valid response without refusal?