habryka comments on Common misconceptions about OpenAI

habryka 1 Sep 2022 20:46 UTC
3 points
0
I did not know!

However, I don’t think this is really the same kind of reference class in terms of risk. It looks like the search engine access for the Facebook case is much more limited and basically just consisted of them appending a number of relevant documents to the query, instead of the model itself being able to send various commands that include starting new searches and clicking on links.
- gwern 1 Sep 2022 21:58 UTC
  5 points
  0
  Parent
  It does generate the query itself, though:
  
  A search query generator: an encoder-decoder Transformer that takes in the dialogue context as input, and generates a search query. This is given to the black-box search engine API, and N documents are returned.
  - habryka 2 Sep 2022 8:57 UTC
    3 points
    0
    Parent
    Does it itself generate the query, or is it a separate trained system? I was a bit confused about this in the paper.
    - gwern 2 Sep 2022 16:29 UTC
      6 points
      0
      Parent
      You’d think they’d train the same model weights and just make it multi-task with the appropriate prompting, but no, that phrasing implies that it’s a separate finetuned model, to the extent that that matters. (I don’t particularly think it does matter because whether it’s one model or multiple, the system as a whole still has most of the same behaviors and feedback loops once it gets more access to data or starts being trained on previous dialogues/sessions—how many systems are in your system? Probably a lot, depending on your level of analysis. Nevertheless...)