I think we agree here. Those both seem like updates against scaling is all you need, i.e. (in this case) “data for DL in ANNs on GPUs is all you need”.
I think we agree here. Those both seem like updates against scaling is all you need, i.e. (in this case) “data for DL in ANNs on GPUs is all you need”.
That’s where I’m disagreeing, because to my mind this doesn’t undermine “scale is all you need”. It does undermine the idea that a basement group could produce AGI, but overall it gives actual limits on what AGI can do for a certain amount of energy.
I think we agree here. Those both seem like updates against scaling is all you need, i.e. (in this case) “data for DL in ANNs on GPUs is all you need”.
That’s where I’m disagreeing, because to my mind this doesn’t undermine “scale is all you need”. It does undermine the idea that a basement group could produce AGI, but overall it gives actual limits on what AGI can do for a certain amount of energy.