Don’t you mean the dataset size was much too large for the smaller models and maybe too small for the largest models?
Don’t you mean the dataset size was much too large for the smaller models and maybe too small for the largest models?