Kei comments on Reward hacking behavior can generalize across tasks