More thoughts that may or may not be directly relevant
What’s missing from my definition is that deception happens solely via “stepping in front of the camera”, i.e. via the regular sensory channels of the deceived optimizer, ie brainwashing or directly modifying memory is not deception
From this follows to deceive is to either cause a false pattern recognition or to prevent a correct one, and for this you indeed need familiarity with the victim’s perceptual categories
I’d like to say more re: hostile telepaths or other deception frameworks but am unsure what your working models are
Interesting, this implies a good deceiver has the power to determine another agent’s model and signal in a way that is aligned with the other’s model. I previously read an article on hostile telepaths https://www.lesswrong.com/posts/5FAnfAStc7birapMx/the-hostile-telepaths-problem which may be pertinent.
More thoughts that may or may not be directly relevant
What’s missing from my definition is that deception happens solely via “stepping in front of the camera”, i.e. via the regular sensory channels of the deceived optimizer, ie brainwashing or directly modifying memory is not deception
From this follows to deceive is to either cause a false pattern recognition or to prevent a correct one, and for this you indeed need familiarity with the victim’s perceptual categories
I’d like to say more re: hostile telepaths or other deception frameworks but am unsure what your working models are