Karl Brisebois comments on larger language models may disappoint you [or, an eternally unfinished draft]

Karl Brisebois 28 Nov 2021 14:52 UTC
2 points
The fact that language models inevitably will end up contradicting themselves is due to the fact that they have finite memory. Asking them not to contradict themselves over sufficiently large amount of text is asking for the impossible: they’re figuring out what to output as the current token by looking at only the last n tokens, so if the contradicting fact lies further back than that there is no way for the model to update on that. And no increase in the size of the models, without fundamental architecture change, will fix that problem.
But architecture changes to deal with that problem might be coming...
- nostalgebraist 28 Nov 2021 18:15 UTC
  11 points
  Parent
  When I talk about self-contradiction in this post, I’m talking about the model contradicting itself in the span of a single context window. In other words, when the contradicting fact is “within the last n tokens.”
  - Darcey 29 Nov 2021 3:24 UTC
    1 point
    Parent
    Aha, thanks for clarifying this; was going to ask this too. :)