I think this argument is made even stronger by additional similar considerations for input tokens too—given the even lower price of input tokens (compared to output tokens), and the scaling laws for long context windows and for RAG.
I think this argument is made even stronger by additional similar considerations for input tokens too—given the even lower price of input tokens (compared to output tokens), and the scaling laws for long context windows and for RAG.