DanielFilan comments on Visible loss landscape basins don’t correspond to distinct algorithms

DanielFilan 28 Jul 2023 19:17 UTC
2 points
0
The second paper is just about linear connectivity, and does seem to suggest that linearly connected models run similar algorithms. But I guess I don’t expect neural net training to go in straight lines? (Altho I suppose momentum helps with this?)