Say Alice has a problem with Bob, but doesn’t know what it is exactly. Then Bob tries to fix it cooperatively by searching in dimension X for settings that alleviate Alice’s problem. If Alice’s problem is actually about Bob’s position on dimension Y, not X, Bob’s activity might appear adversarial: Bob’s actions are effectively goodharting Alice’s sense of whether things are good, in the same way he’d do if he were actually trying to distract Alice from Y.
Say Alice has a problem with Bob, but doesn’t know what it is exactly. Then Bob tries to fix it cooperatively by searching in dimension X for settings that alleviate Alice’s problem. If Alice’s problem is actually about Bob’s position on dimension Y, not X, Bob’s activity might appear adversarial: Bob’s actions are effectively goodharting Alice’s sense of whether things are good, in the same way he’d do if he were actually trying to distract Alice from Y.