I think you’re mostly right about the world, but I’m going to continue to articulate disagreements based on my sense of dissatisfaction. You should probably mostly read me as speaking from a should-world rather than being very confused about the is-world.
The bitter lesson is insufficient to explain the lack of structure we’re seeing. I gave the example of Whisper. I haven’t actually used Whisper, so correct me if I’m wrong—maybe there is a way to get more nuanced probabilistic information out of it? But the bitter lesson is about methods of achieving capabilities, not about the capabilities themselves. Producing a plain text output rather than a more richly annotated text that describes some of the uncertainty is a design choice.
To give another example, LLMs could learn a conditional model that’s annotated with metadata like author, date and time, etc. Google Lambda reportedly had something like author-vectors, so that different “characters” could be easily summoned. I would love to play with that, EG, averaging the author-vectors of specific authors I’m interested in to see what it’s like to mix their voices. In theory, LLMs could also learn to predict the metadata. You could use partially-labeled-data approaches such as the EM algorithm to train LLMs to predict the metadata labels while also training them to use those labels in text generation. This would give a rich set of useful capabilities. This would be scientifically useful, too: predictions of date, author, and other metadata would be interesting for historians.
In this way, we could actually get closer to a “truth machine”. There’s a world of difference between the kinds of inferences you can make from ChatGPT’s opinion about a text vs inferences you could make by treating these as legit latent variables and seeing what latent-variable-inference algorithms think.
You say “it’s a matter of demand”. So why does it feel so much like big tech is eager to push the chatbot model on everyone? Everyone is scared to fall behind in the AI race, ever since ChatGPT made it feel plausible that AI would drive significant market share. But did big tech make the correct guess about what there was demand for? Or did everyone over-update on the specific form of ChatGPT, and now there’s just not that much else out there to reveal what the demand really is? Little sparkle-emoji buttons decorate everything; press to talk to the modern equivalent of Clippy.
I think you’re mostly right about the world, but I’m going to continue to articulate disagreements based on my sense of dissatisfaction. You should probably mostly read me as speaking from a should-world rather than being very confused about the is-world.
The bitter lesson is insufficient to explain the lack of structure we’re seeing. I gave the example of Whisper. I haven’t actually used Whisper, so correct me if I’m wrong—maybe there is a way to get more nuanced probabilistic information out of it? But the bitter lesson is about methods of achieving capabilities, not about the capabilities themselves. Producing a plain text output rather than a more richly annotated text that describes some of the uncertainty is a design choice.
To give another example, LLMs could learn a conditional model that’s annotated with metadata like author, date and time, etc. Google Lambda reportedly had something like author-vectors, so that different “characters” could be easily summoned. I would love to play with that, EG, averaging the author-vectors of specific authors I’m interested in to see what it’s like to mix their voices. In theory, LLMs could also learn to predict the metadata. You could use partially-labeled-data approaches such as the EM algorithm to train LLMs to predict the metadata labels while also training them to use those labels in text generation. This would give a rich set of useful capabilities. This would be scientifically useful, too: predictions of date, author, and other metadata would be interesting for historians.
In this way, we could actually get closer to a “truth machine”. There’s a world of difference between the kinds of inferences you can make from ChatGPT’s opinion about a text vs inferences you could make by treating these as legit latent variables and seeing what latent-variable-inference algorithms think.
To give a third example, there’s Drexler’s Quasilinguistic Neural Representations.
You say “it’s a matter of demand”. So why does it feel so much like big tech is eager to push the chatbot model on everyone? Everyone is scared to fall behind in the AI race, ever since ChatGPT made it feel plausible that AI would drive significant market share. But did big tech make the correct guess about what there was demand for? Or did everyone over-update on the specific form of ChatGPT, and now there’s just not that much else out there to reveal what the demand really is? Little sparkle-emoji buttons decorate everything; press to talk to the modern equivalent of Clippy.