neverix comments on SolidGoldMagikarp (plus, prompt generation)

neverix 6 Feb 2023 15:57 UTC
7 points
0
It is based on this. I changed it to optimize using softmax instead of straight-through estimation and added regularization for the embedded tokens.
Notebook link—this is a version that mimics this post instead of optimizing a single neuron as in the original.
EDIT: github link