This recent comment thread discussing whether RLHF makes any progress beyond the classical “reward the agent when humans press the reward button” idea.
This recent comment thread discussing whether RLHF makes any progress beyond the classical “reward the agent when humans press the reward button” idea.