artifex0 comments on Paper: LLMs trained on “A is B” fail to learn “B is A”

artifex0 25 Sep 2023 15:22 UTC
11 points
1
Speaking for myself, I would have confidently predicted the opposite result for the largest models.

My understanding is that LLMs work by building something like a world-model during training by compressing the data into abstractions. I would have expected something like “Tom Cruise’s mother is Mary Lee Pfeiffer” to be represented in the model as an abstract association between the names that could then be “decompressed” back into language in a lot of different ways.

The fact that it’s apparently represented in the model only as that exact phrase (or maybe as some kind of very alien abstraction?) leads me think that LLMs are either a bit more like “stochastic parrots” than I would have expected, or that their world-models are a lot more alien.
- siclabomines 25 Sep 2023 19:00 UTC
  1 point
  0
  Parent
  The largest models should be expected to compress less than smaller ones though, right?