I think you and the linked post might have mismatching definitions of reward. It seems like your definition is that reward is what the AI values, but the linked post uses reward to mean the reward function specified by the programmers that is used to train the AI.
I think you and the linked post might have mismatching definitions of reward. It seems like your definition is that reward is what the AI values, but the linked post uses reward to mean the reward function specified by the programmers that is used to train the AI.