Nathan Helm-Burger comments on KAN: Kolmogorov-Arnold Networks

Nathan Helm-Burger 4 May 2024 3:36 UTC
3 points
0
So, after reading the KAN paper, and thinking about it in the context of this post: https://www.lesswrong.com/posts/gTZ2SxesbHckJ3CkF/transformers-represent-belief-state-geometry-in-their
My vague intuition is that the same experiment done with a KAN would result in a clearer fractal which wiggled less once training loss had plateaued. Is that also other people’s intuition?