My impression (could be totally wrong) was that GPT-4 won’t be much larger than GPT-3 but it’s effective parameter size will be much larger by using techniques like this.
My impression (could be totally wrong) was that GPT-4 won’t be much larger than GPT-3 but it’s effective parameter size will be much larger by using techniques like this.