Nina Panickssery comments on Red-teaming language models via activation engineering