In general a language model will ‘know’ the sentence related to the single occurrence of a rare name. I don’t think you learn much here if there are enough parameters available to support this memory.
In general a language model will ‘know’ the sentence related to the single occurrence of a rare name. I don’t think you learn much here if there are enough parameters available to support this memory.