Adam Pearce
Karma: 27
The optimization section of Learning Transformer Programs might work with your task/model
You’ve probably seen David Ha’s work, but something like https://es-clip.github.io/ could be a good starting point for dropping backprop.
The exotic activation function almost feels like cheating? Like I want the model the model to discover these useful structures, then try to understand them. But trying to do everything at once may be too hard.
Incredibility minor, but changing from
onchange
tooninput
and dropping the animation will make the slider feel much slicker.
- Aug 7, 2023, 10:57 PM; 8 points) 's comment on Growing Bonsai Networks with RNNs by (
Lots of custom d3 https://github.com/PAIR-code/ai-explorables/tree/master/source/grokking