Roger Dearnaley comments on Agentized LLMs will change the alignment landscape

Roger Dearnaley 22 Apr 2023 4:14 UTC
3 points
2
It’s also quite likely that something like Auto-GPT would work a lot better using a version of LLM that had been fine-tuned/reinforcement-trained for this specific usecase—just as Chat-GPT is a lot more effective as a chatbot than the underlying GPT-3 model was before the specialized training. If the LLM is optimized for the wrapper and the wrapper designed to make efficient use of the entire context-size of the LLM, thinks are going to work a lot better.
- RogerDearnaley 5 Dec 2023 9:09 UTC
  1 point
  0
  Parent
  7 months later, we now know that this is true. Also, we now know that you can take output from a prompted/scaffolded LLM and use it to fine-tune another LLM to do the same things without needing prompt/scaffold.
  - RohanS 25 Jul 2024 8:40 UTC
    2 points
    0
    Parent
    Could you please point out the work you have in mind here?