metachirality comments on ChatGPT defines 10 concrete terms: generically, for 5- and 11-year-olds, and for a scientist

metachirality Apr 12, 2024, 3:34 PM
1 point
0

I think that requires labeled data.

It doesn’t and the developers don’t label the data. The LLM learns that these categories exist during training because they can and it helps minimize the loss function.
- Bill Benzon Apr 12, 2024, 3:48 PM
  1 point
  0
  Parent
  By labeled data I simply mean that children’s stories are likely to be identified as such in the data. Children’s books are identified as children’s books. Otherwise, how is the model to “know” what language is appropriate for children? Without some link between the language and a certain class of people it’s just more text. My prompt specifies 5-year olds. How does the model connect that prompt with a specific kind of language?