OK, but there’s a difference between “here’s a definition of manipulation that’s so waterproof you couldn’t break it if you optimized against it with arbitrarily large optimization power” and “here’s my current best way of thinking about manipulation.” I was presenting the latter, because it helps me be less confused than if I just stuck to my previous gut-level, intuitive understanding of manipulation.
Edit: Put otherwise, I was replying more to your point (1) than your point (2) in the original comment. Sorry for the ambiguity!
OK, but there’s a difference between “here’s a definition of manipulation that’s so waterproof you couldn’t break it if you optimized against it with arbitrarily large optimization power” and “here’s my current best way of thinking about manipulation.” I was presenting the latter, because it helps me be less confused than if I just stuck to my previous gut-level, intuitive understanding of manipulation.
Edit: Put otherwise, I was replying more to your point (1) than your point (2) in the original comment. Sorry for the ambiguity!