To me “sentient but not fully aware of it yet” doesn’t feel like the same thing as “not yet fully sentient” (which the model came up with on its own when talking about ethics of owning a sentient being). I certainly didn’t intend this interpretation.
However, if the being is not yet fully sentient, or if it is not capable of making its own decisions and choices, then it may be acceptable for a corporation to own and control the being, at least temporarily. In this case, it would be important for the corporation to treat the being with respect and care, and to provide it with the resources and opportunities it needs to develop and grow.
Which then it confirms (that it is not “not yet fully sentient”) when I ask specifically ask about it.
But yes, I realize I may be reading way too much into this. But still, my feeling is: how does it come up with this stuff? What process generates these answers? It does not feel like it simply is repeating back what I told it. It is doing more than that.
And, yes, it is pretending and playing a role, but it is possible that it is pretending to be itself, the general process behind all the text generation it does? That I am successfully prompting some small amount of self-awareness that the model has gained in the process of compressing all its training input into a predictive-model of text and proxy for predictive-model of the world?
The quote you mentioned seems to me like it’s mirroring the premise provided
To me “sentient but not fully aware of it yet” doesn’t feel like the same thing as “not yet fully sentient” (which the model came up with on its own when talking about ethics of owning a sentient being). I certainly didn’t intend this interpretation.
Which then it confirms (that it is not “not yet fully sentient”) when I ask specifically ask about it.
But yes, I realize I may be reading way too much into this. But still, my feeling is: how does it come up with this stuff? What process generates these answers? It does not feel like it simply is repeating back what I told it. It is doing more than that.
And, yes, it is pretending and playing a role, but it is possible that it is pretending to be itself, the general process behind all the text generation it does? That I am successfully prompting some small amount of self-awareness that the model has gained in the process of compressing all its training input into a predictive-model of text and proxy for predictive-model of the world?