Arthur Conmy comments on Some open-source dictionaries and dictionary learning infrastructure

Arthur Conmy 7 Feb 2024 22:26 UTC
1 point
0
Do you apply LR warmup immediately after doing resampling (i.e. immediately reducing the LR, and then slowly increasing it back to the normal value)? In my GELU-1L blog post I found this pretty helpful (in addition to doing LR warmup at the start of training)
- Sam Marks 7 Feb 2024 22:33 UTC
  3 points
  1
  Parent
  At the time that I made this post, no, but this has been implemented in dictionary_learning since I saw your suggestion to do so in your linked post.