Deepseek v3 is one example, and semianalysis has claimed that most labs use FP8.
FP8 Training is important as it speeds up training compared to BF16 & most frontier labs use FP8 Training.
Deepseek v3 is one example, and semianalysis has claimed that most labs use FP8.