Here’s one potential reason why this works and a list of neuroscience papers which empirically show linearity between LLMs and human linguistic representations.
Given the deep similarities between biological nets and LLMs, I wonder if a technique similar to “activation engineering” could be used for robust mind control and/or brainwashing.
Here’s one potential reason why this works and a list of neuroscience papers which empirically show linearity between LLMs and human linguistic representations.
Given the deep similarities between biological nets and LLMs, I wonder if a technique similar to “activation engineering” could be used for robust mind control and/or brainwashing.