TurnTrout comments on TurnTrout’s shortform feed

TurnTrout 5 Feb 2024 19:37 UTC
LW: 8 AF: 5
6
AF
the fact that the model emits sentences in the grammatical first person doesn’t seem like reliable evidence that it “really knows” it’s talking about “itself”
I consider situational awareness to be more about being aware of one’s situation, and how various interventions would affect it. Furthermore, the main evidence I meant to present was “ChatGPT 3.5 correctly responds to detailed questions about interventions on its situation and future operation.” I think that’s substantial evidence of (certain kinds of) situation awareness.