TurnTrout comments on TurnTrout’s shortform feed

TurnTrout 5 Feb 2024 19:24 UTC
LW: 4 AF: 4
2
AF
I’ve seen mixed data on how important curricula are for deep learning. One paper (on CIFAR) suggested that curricula only help if you have very few datapoints or the labels are noisy. But possibly that doesn’t generalize to LLMs.
- ryan_greenblatt 5 Feb 2024 20:23 UTC
  LW: 6 AF: 6
  4
  AF Parent
  I think data ordering basically never matters for LLM pretraining. (As in, random is the best and trying to make the order more specific doesn’t help.)
  - Daniel Kokotajlo 5 Feb 2024 21:22 UTC
    LW: 2 AF: 2
    −1
    AF Parent
    That was my impression too.