Nathan Helm-Burger comments on Claude Sonnet 3.5.1 and Haiku 3.5

Nathan Helm-Burger 25 Oct 2024 5:03 UTC
2 points
0
For raw IQ, sure. I just mean “conversational flavor”.
- Ann 25 Oct 2024 13:12 UTC
  3 points
  0
  Parent
  If system prompts aren’t enough but fine-tuning is, this should be doable with different adapters that can be loaded at inference time; not needing to distill into separate models.
  - Nathan Helm-Burger 25 Oct 2024 17:04 UTC
    2 points
    0
    Parent
    Yes, I agree that’s an alternative. Then you’d need the primary model to be less RLHF’d and focused. A more raw model should be capable, with an adapter, of expressing a wider variety of behaviors.
    
    I still think that distilling down from specialized large teacher models world likely give the best result, but that’s just a hunch.