Jessica Rumbelow comments on SolidGoldMagikarp (plus, prompt generation)

Jessica Rumbelow 5 Feb 2023 21:06 UTC
4 points
0
Interesting! Can you give a bit more detail or share code?
- neverix 6 Feb 2023 15:57 UTC
  7 points
  0
  Parent
  It is based on this. I changed it to optimize using softmax instead of straight-through estimation and added regularization for the embedded tokens.
  Notebook link—this is a version that mimics this post instead of optimizing a single neuron as in the original.
  EDIT: github link