GPT-4o both has a new tokenizer and was trained directly on audio (whereas my understanding is that GPT-4 was trained only on text and images). Is there precedent for upgrading a model to a new tokenizer? It seems like it’s probably better to think of it as an entirely new model. If that’s the case, what actually makes it a GPT-4?
[Question] How is GPT-4o Related to GPT-4?
GPT-4o both has a new tokenizer and was trained directly on audio (whereas my understanding is that GPT-4 was trained only on text and images). Is there precedent for upgrading a model to a new tokenizer? It seems like it’s probably better to think of it as an entirely new model. If that’s the case, what actually makes it a GPT-4?