“Reward Tampering Problems and Solutions in Reinforcement Learning” describes how to do what you outlined.
“Reward Tampering Problems and Solutions in Reinforcement Learning” describes how to do what you outlined.