People have already been training models on doing CoT and similar techniques, certainly via fine-tuning, and I strongly suspect also at the “significant proportion of synthetic data in the training dataset” level. My impression (from the outside) is that it’s working well.
People have already been training models on doing CoT and similar techniques, certainly via fine-tuning, and I strongly suspect also at the “significant proportion of synthetic data in the training dataset” level. My impression (from the outside) is that it’s working well.