Dave Orr comments on Paper: LLMs trained on “A is B” fail to learn “B is A”

Dave Orr 24 Sep 2023 16:23 UTC
5 points
0

Partly this will be because in fact current ML systems are not analogous to future AGI in some ways—probably if you tell the AGI that A is B, it will also know that B is A.

One oddity of LLMs is that we don’t have a good way to tell the model that A is B in a way that it can remember. Prompts are not persistent, and as this paper shows, fine tuning doesn’t do a good job of getting a fact into the model without doing a bunch of paraphrasing. Pretraining presumably works in a similar way.

This is weird! And I think helps make sense of some of the problems we see with current language models.
- Owain_Evans 24 Sep 2023 17:19 UTC
  5 points
  0
  Parent
  Yes, the model editing literature has various techniques and evaluations for trying to put a fact into a model.
  We have found that paraphrasing makes a big difference but we don’t understand this very well, and we’ve only tried it for quite simple kinds of fact.
- Michael Tontchev 27 Sep 2023 23:58 UTC
  2 points
  0
  Parent
  Maybe our brains do a kind of expansion of a fact before memorizing it and its neighbors in logic space.