hold_my_fish comments on Thoughts on hardware / compute requirements for AGI

hold_my_fish 24 Jan 2023 20:55 UTC
LW: 3 AF: 3
0
AF
NTK training requires training time that scales quadratically with the number of training examples, so it’s not usable for large training datasets (nor with data augmentation, since that simulates a larger dataset). (I’m not an NTK expert, but, from what I understand, this quadratic growth is not easy to get rid of.)

hold_my_fish comments on Thoughts on hardware /​ compute requirements for AGI