gwern comments on Common misconceptions about OpenAI

gwern 1 Sep 2022 21:58 UTC
5 points
0
It does generate the query itself, though:

A search query generator: an encoder-decoder Transformer that takes in the dialogue context as input, and generates a search query. This is given to the black-box search engine API, and N documents are returned.
- habryka 2 Sep 2022 8:57 UTC
  3 points
  0
  Parent
  Does it itself generate the query, or is it a separate trained system? I was a bit confused about this in the paper.
  - gwern 2 Sep 2022 16:29 UTC
    6 points
    0
    Parent
    You’d think they’d train the same model weights and just make it multi-task with the appropriate prompting, but no, that phrasing implies that it’s a separate finetuned model, to the extent that that matters. (I don’t particularly think it does matter because whether it’s one model or multiple, the system as a whole still has most of the same behaviors and feedback loops once it gets more access to data or starts being trained on previous dialogues/sessions—how many systems are in your system? Probably a lot, depending on your level of analysis. Nevertheless...)