Owain_Evans comments on GPT learning from smarter texts?

Owain_Evans 11 Jan 2023 16:18 UTC
2 points
0
See the Galatica model (https://arxiv.org/abs/2211.09085) from Meta. It’s trained on a curated dataset of scientific papers, reference materials and scientific knowledge bases (with only a very small % of random internet text). IIRC the benefits of this seem limited (better to train on a bigger dataset and use other techniques to make the model access the sciencey parts of the training set).