Ann comments on Reward hacking behavior can generalize across tasks