L Rudolf L comments on Me, Myself, and AI: the Situational Awareness Dataset (SAD) for LLMs

L Rudolf L 9 Jul 2024 18:25 UTC
5 points
0
Did you explain to GPT-4 what temperature is? GPT-4, especially before November, knew very little about LLMs due to training data cut-offs (e.g. the pre-November GPT-4 didn’t even know that the acronym “LLM” stood for “Large Language Model”).
Either way, it’s interesting that there is a signal. This feels similar in spirit to the self-recognition tasks in SAD (since in both cases the model has to pick up on subtle cues in the text to make some inference about the AI that generated it).
- Lovre 9 Jul 2024 19:10 UTC
  4 points
  0
  Parent
  I didn’t explain it, but from playing with it I had the impression that it did understand what “temperature” was reasonably well (e.g. gpt-4-0613, which is the checkpoint I tested, answers In the context of large language models like GPT-3, "temperature" refers to a parameter that controls the randomness of the model's responses. A higher temperature (e.g., 0.8) would make the output more random, whereas a lower temperature (e.g., 0.2) makes the output more focused and deterministic. [...] to the question What is "temperature", in context of large language models?).
  Another thing I wanted to do was compare GPT-4′s performance to people’s performance on this task, but I never got around to doing it.