lberglund comments on Paper: LLMs trained on “A is B” fail to learn “B is A”

lberglund 15 Nov 2023 19:16 UTC
4 points
2
We actually do train on both the prompt and completion. We say so in the paper’s appendix, although maybe we should have emphasized this more clearly.
- Daniel Paleka 15 Nov 2023 19:41 UTC
  2 points
  1
  Parent
  Oh so you have prompt_loss_weight=1, got it. I’ll cross out my original comment. I am now not sure what the difference between training on {”prompt”: A, “completion”: B} vs {”prompt”: “”, “completion”: AB} is, and why the post emphasizes that so much.