total compute needed for training = 3.82e12 flp/n/s * 4.78e8 bio_seconds * 1.3e9 neurons = 2.37e30 flops = 2.37e15 petaFLOPs
Should that be 3.82e15 flp/n/s, based on the numbers right above?
Yes, thanks for catching that.
edit: fixed
Should that be 3.82e15 flp/n/s, based on the numbers right above?
Yes, thanks for catching that.
edit: fixed