But equivalently a reward function can also penalise unauthorised power-gaining, given equal ability to notice it by the supervisors in both cases.
This is likely the crux of our disagreement, but I don’t have time to reply ATM. Hope to return to this.
This is likely the crux of our disagreement, but I don’t have time to reply ATM. Hope to return to this.