cherrvak comments on Pretraining Language Models with Human Preferences

cherrvak 23 Feb 2023 1:06 UTC
5 points
2
I thoroughly enjoyed this paper, and would very much like to see the same experiment performed on language models in the billion-parameter range. Would you expect the results to change, and how?
- Tomek Korbak 24 Feb 2023 14:52 UTC
  2 points
  0
  Parent
  Good question! We’re not sure. The fact that PHF scales well with dataset size might provide weak evidence that it would scale well with model size too.