Even if it’s the same cost to train, wouldn’t it still be a win if inference is a significant part of your compute budget?
Even if it’s the same cost to train, wouldn’t it still be a win if inference is a significant part of your compute budget?