Minor correction. You’re saying: > So training a 1-million parameter model on 10 books takes about as many FLOPS as training a 10-million parameter model on one book.
You link to FLOP per second aka FLOPS, whereas you’re talking about the plural of FLOP, a quantity (often used is FLOPs).
Minor correction. You’re saying:
> So training a 1-million parameter model on 10 books takes about as many FLOPS as training a 10-million parameter model on one book.
You link to FLOP per second aka FLOPS, whereas you’re talking about the plural of FLOP, a quantity (often used is FLOPs).