Jessica Rumbelow comments on SolidGoldMagikarp (plus, prompt generation)

Jessica Rumbelow 5 Feb 2023 12:11 UTC
2 points
0
Yep, aside from running forward prop n times to generate an output of length n, we can just optimise the mean probability of the target tokens at each position in the output—it’s already implemented in the code. Although, it takes way longer to find optimal completions.