the fact that the model emits sentences in the grammatical first person doesn’t seem like reliable evidence that it “really knows” it’s talking about “itself”
I consider situational awareness to be more about being aware of one’s situation, and how various interventions would affect it. Furthermore, the main evidence I meant to present was “ChatGPT 3.5 correctly responds to detailed questions about interventions on its situation and future operation.” I think that’s substantial evidence of (certain kinds of) situation awareness.
I consider situational awareness to be more about being aware of one’s situation, and how various interventions would affect it. Furthermore, the main evidence I meant to present was “ChatGPT 3.5 correctly responds to detailed questions about interventions on its situation and future operation.” I think that’s substantial evidence of (certain kinds of) situation awareness.