Yeah. Philosophically to me the takeaway would be something like this. Conversation alone is not enough. We can only make conclusions (is the entity manipulative etc) if we know at least some constraints under which the entity operates. For example if it’s a human similar to us, or a language model with a finite horizon, or an AI correctly built to be friendly. If you don’t know any such constraints, then any conclusions you make from conversation are at your own peril, except 2+2=4 and other facts that can be verified independently.
Yeah. Philosophically to me the takeaway would be something like this. Conversation alone is not enough. We can only make conclusions (is the entity manipulative etc) if we know at least some constraints under which the entity operates. For example if it’s a human similar to us, or a language model with a finite horizon, or an AI correctly built to be friendly. If you don’t know any such constraints, then any conclusions you make from conversation are at your own peril, except 2+2=4 and other facts that can be verified independently.