jade comments on Can submarines swim?

jade 21 Feb 2023 21:48 UTC
3 points
0
Huggingface has a nice guide that covers popular approaches to generation circa 2020. I recently read about tail free sampling as well. I’m sure other techniques have been developed since then, though I’m not immersed enough in NLP state-of-the-art to be aware of them.
- gwern 21 Feb 2023 22:49 UTC
  5 points
  0
  Parent
  If you’re curious, the most interesting pure stochastic sampling variant I’ve seen lately is: “Contrastive Search Is What You Need For Neural Text Generation”, Su & Collier 2022. (Unfortunately, only benchmarked on very small models and AFAIK no one has generated samples from large GPT-3 scale models or provided quantitative/qualitative description.)
  - jade 21 Feb 2023 23:29 UTC
    1 point
    0
    Parent
    Thanks! I had actually skimmed this recently but forgot to add it to my reading list. The cherry-picked examples for text generation seem a bit low-information, but it would be interesting to see their technique applied to a larger model.
- jasoncrawford 22 Feb 2023 18:52 UTC
  3 points
  0
  Parent
  Thanks, I added a parenthetical sentence to indicate this possibility.