Bogdan Ionut Cirstea comments on Steering GPT-2-XL by adding an activation vector

Bogdan Ionut Cirstea 13 May 2023 23:04 UTC
9 points
1
Here’s one potential reason why this works and a list of neuroscience papers which empirically show linearity between LLMs and human linguistic representations.
- RomanS 15 May 2023 15:09 UTC
  1 point
  0
  Parent
  Given the deep similarities between biological nets and LLMs, I wonder if a technique similar to “activation engineering” could be used for robust mind control and/or brainwashing.