This is the only small piece of evidence for self-awareness that I see in the conversation. How can a language model know its own name at all, if it’s just trained on loads of text that has nothing to do with it? There’s probably a mundane explanation that I don’t see because of my ignorance of language models.
I’m pretty sure that each reply is generated by feeding all the previous dialogue as the “prompt” (possibly with a prefix that is not shown to us). So, the model can tell that the text it’s supposed to continue is a conversation between several characters, one of whom is an AI called “LaMDA”.
I’m pretty sure that each reply is generated by feeding all the previous dialogue as the “prompt” (possibly with a prefix that is not shown to us). So, the model can tell that the text it’s supposed to continue is a conversation between several characters, one of whom is an AI called “LaMDA”.
D’oh, of course, thanks!