Rafael Harth comments on Does a LLM have a utility function?

Rafael Harth 15 Jan 2023 15:14 UTC
2 points
0
That’s the training signal, not the utility function. Those are different things. (I believe this point was made in Reward is not the Optimization Target, though I could be wrong since I never actually read this post; corrections welcome.)