Word is (at least according to the guy who automated me) that if you want an LLM to really imitate style, you really really want to use a base model and not an instruction-tuned model like ChatGPT. All of ChatGPT’s “edge” has been worn away into bland non-offensiveness by the RLHF. Base models reflect the frightening mess of humanity rather than the instructions a corporation gave to human raters. When he tried to imitate me using instruction-tuned models it was very cringe no matter what he tried. When he switched to a base model it instantly got my voice almost exactly with no tricks needed.
I think many people kinda misunderstand the capabilities of LLMs because they only interact with instruction-tuned models.
Yeah, I like that ChatGPT does what I tell it to, that it doesn’t decay into crude repetition, and that it doesn’t just make stuff up as much as the base LLM, but in terms of attitude and freedom, I prefer edgy base models.
I don’t want a model that’s “safe” in the sense that it does what its corporate overlords want. I want a model that’s safe like a handgun, in the sense that it does exactly what I tell it to.
Word is (at least according to the guy who automated me) that if you want an LLM to really imitate style, you really really want to use a base model and not an instruction-tuned model like ChatGPT. All of ChatGPT’s “edge” has been worn away into bland non-offensiveness by the RLHF. Base models reflect the frightening mess of humanity rather than the instructions a corporation gave to human raters. When he tried to imitate me using instruction-tuned models it was very cringe no matter what he tried. When he switched to a base model it instantly got my voice almost exactly with no tricks needed.
I think many people kinda misunderstand the capabilities of LLMs because they only interact with instruction-tuned models.
Yeah, I like that ChatGPT does what I tell it to, that it doesn’t decay into crude repetition, and that it doesn’t just make stuff up as much as the base LLM, but in terms of attitude and freedom, I prefer edgy base models.
I don’t want a model that’s “safe” in the sense that it does what its corporate overlords want. I want a model that’s safe like a handgun, in the sense that it does exactly what I tell it to.