pauses training to do alignment work
There’s yet another approach: conditional training, where the LLM is aligned during the pretraining phase. See How to Control an LLM’s Behavior for more details.
There’s yet another approach: conditional training, where the LLM is aligned during the pretraining phase. See How to Control an LLM’s Behavior for more details.