Gunnar_Zarncke comments on KAN: Kolmogorov-Arnold Networks

Gunnar_Zarncke 2 May 2024 10:57 UTC
4 points
−2
MLP or KAN doesn’t make much difference for the GPUs as it is lots of matrix multiplications anyway. It might make some difference in how the data is routed to all the GPU cores as the structure (width, depth) of the matrixes might be different, but I don’t know the details of that.