My response to this is extremely negative, since I could hardly disagree with the assumptions of this post more. It is just so wrong. I’m not even sure there is a point in engaging across this obvious complete disagreement, and my commenting at all may be pointless. Even if you grant that there are many possible definitions of consciousness, and that people mean somewhat different things by them, the premise of this article is completely and obviously wrong since chatbots clearly do not have any consciousness, by any even vaguely plausible definition. It is so blatantly obvious. There is literally no reason to believe an LLM is conscious even if I were to allow terribly weak definitions of consciousness. (It may be possible for something we recognize as an AI to be conscious, but definitely not an LLM. We know how LLMs work, and they just don’t do these things.)
As to the things you are greater than 90% sure of, I am greater than 99% certain they do not ‘experience’: Introspection, purposefulness, experiential coherence, perception of perception, awareness of awareness, sense of cognitive extent, and memory of memory. Only symbol grounding am I not greater than 99% sure they don’t ‘experience’ because instead they just have an incredibly tiny amount of grounding, but symbol grounding is not consciousness either even if in full. Grounding is knowledge and understanding related, but is clearly not consciousness. Also, purposefulness is clearly not even especially related to consciousness (you would have to think a purely mechanical thermostat is ‘conscious’ with this definition.).
Similarly, I am greater than 99% certain that they experience none of the ones you are 50% sure of: #4 (holistic experience of complex emotions), #5 (experience of distinctive affective states), #6 (pleasure and pain), #12 (alertness), #13 (detection of cognitive uniqueness), and #14 (mind-location). A number of these clearly aren’t related to consciousness either (pleasure/pain, alertness, probably detection of cognitive uniqueness though I can’t be sure of the last because it is too vague to be sure what you mean).
Additionally I am 100% sure of the ones you are only 75% sure of. There is logically no possibility that current llms have proprioception, awakeness, or vestibular sense. (Why in the world is that even mentioned?) (Awakeness is definitely not even fully required for consciousness, while the other two have nothing to do with it at all.)
Anyone who thinks an LLM has consciousness is just anthropomorphizing anything that has the ability to put together sentences. (Which, to be fair, used to be a heuristic that worked pretty well.)
The primary reason people should care about consciousness is related to the question of ‘are they people?’ (in the same meaning that humans, aliens, and certain androids in Star Trek or other scifi are people.) It is 100% certain that unless panpsychism is true (highly unlikely, and equally applicable to a rock), this kind of device is not a person.
I’m not going to list why you are wrong on every point in the appendix, just some. Nothing in your evidence seems at all convincing. Introspection: The ability to string together an extra sentence on what a word in a sentence could mean isn’t even evidence on introspection. (At most it would be evidence of ability to do that about others, not itself.) We know it doesn’t know why it did something. Purposefulness: Not only irrelevant to consciousness but also still not evidence. It just looks up in its context window what you told it to do and then comes up with another sentence that fits. Perception of perception: You are still tricking yourself with anthropomorphization. The answer to the question is always more likely a sentence like ‘no’. The actual trick would be giving them a picture where the answer should be the opposite of the normal ‘no’ answer. As you continue on, you keep asking leading questions in a way that have obvious answers, and this is exactly what it is designed to do. We know how an LLM operates, and what it does is follow leads to complete sentences. You don’t seem to understand symbol grounding, which is not about getting it to learn new words disconnected from the world, but about how the words relate to the world.
I think many of the things Critch has listed as definitions of consciousness are not “weak versions of some strong version”, they’re just different things.
You bring up a few times that LLMs don’t “experience” [various things Critch lists here]. I agree, they pretty likely don’t (in most cases). But, part of what I interpreted Critch’s point here to be was that there are many things that people mean by “consciousness” that aren’t actually about “experience” or “qualia” or whatnot.
For example, I’d bet (75%) that when Critch says they have introspection, he isn’t making any claims about them “experiencing” anything at all – I think he’s instead saying “in the same way that their information processing system knows facts about Rome and art and biology and computer programming, and can manipulate those facts, it can also know and manipulate facts about it’s thoughts and internal states.” (whereas other ML algorithms may not be able to know and manipulate their thoughts and internal states)
Purposefulness: Not only irrelevant to consciousness but...
A major point Critch was making in previous post is that when people say “consciousness”, this is one of the things they sometimes mean. The point is not that LLMs are conscious the way you are using the word, but that when you see debates about whether they are conscious, it will include some people who think it means “purposefulness.”
I agree that people use consciousness to mean different things, but some definitions need to be ignored as clearly incorrect. If someone wants to use a definition of ‘red’ that includes large amounts of ‘green’, we should ignore them. Words mean something, and can’t be stretched to include whatever the speaker wants them to if we are to speak the same language (so leaving aside things like how ‘no’ means ‘of’ in Japanese). Things like purposefulness are their own separate thing, and have a number of terms meant to be used with them, that we can meaningfully talk about if people choose to use the right words. If ‘introspection’ isn’t meant as the internal process, don’t use the term because it is highly misleading. I do think you are probably right about what Critch thinks when using the term introspection, but he would still be wrong if he meant that (since they are reflecting on word choice not on the internal states that led to it.)
I don’t feel very hopeful about the conversation atm, but fwiw I feel like you are missing a fairly important point while being pretty overconfident about not having missed it.
Putting a different way: is there a percent of people who could disagree with you about what consciousness means, which would convince you that you it’s not as straightforward as assuming you have the correct definition of consciousness, and that you can ignore everyone else? If <50% of people agreed with you? If <50% of the people with most of the power?
(This is not about whether your definition is good, or the most useful, or whatnot – only that, if lots of people turned out to be mean different things by it, would it still particularly matter whether your definition was the “right” one?)
(My own answer is that if like >75% of people agreed on what consciousness means, I’d be like “okay yeah Critch’s point isn’t super compelling”. If it was between like 50 − 75% of people I’d like “kinda edge case.” If it’s <50% of people agreeing on consciousness, I don’t think it matters much what definition is “correct.”)
Your comment is not really a response to the comment I made. I am not missing the point at all, and if you think I have I suspect you missed my point very badly (and are yourself extremely overconfident about it). I have explicitly talked about there being a number of possible definitions of consciousness multiple times and I never favored one of them explicitly. I repeat, I never assumed a specific definition of consciousness, since I don’t have a specific one I assume at all, and I am completely open to talking about a number of possibilities. I simply pointed out that some proposed definitions are clearly wrong / useless / better described with other terms. Do not assume what I mean if you don’t understand.
Note that I am a not a prescriptivist when it comes to language. The reason the language is wrong isn’t because I have a particular way you should talk about it, but because the term is being used in a way that doesn’t actually fit together with the rest of the language, and thus does not actually convey the intended meaning. If you want to talk about something, talk about it with words that convey that meaning.
On to ‘how many people have to disagree’ for that to matter? One, if they have a real point, but if no one agrees on what a term means it is meaningless. ‘Consciousness’ is not meaningless, nor is introspection, or the other words being used. Uses that are clearly wrong are a step towards words being meaningless, and that would be a bad thing. Thus, I should oppose it.
Also, my original comment was mostly about direct disagreements with his credences, and implications thereof, not about the definition of consciousness.
Having a vague concept encompassing multiple possible definitions, which you are nonetheless extremely confident is the correct vague concept, is not that different from having a single definition in which you’re confident, and not everyone shares your same vague concept or agrees that it’s clearly the right one.
This statement is obviously incorrect. I have a vague concept of ‘red’, but I can tell you straight out that ‘green’ is not it, and I am utterly correct. Now, where does it go from ‘red’ to ‘orange’? We could have a legitimate disagreement about that. Anyone who uses ‘red’ to mean ‘green’ is just purely wrong.
That said, it wouldn’t even apply to me if your (incorrect) claim about a single definition not being different from an extremely confident vague definition was right. I don’t have ‘extreme confidence’ about consciousness even as a vague concept. I am open to learning new ways of thinking about it and fundamentally changing the possibilities I envision.
I have simply objected to ones that are obviously wrong based on how the language is generally used because we do need some limit to what counts to discuss anything meaningfully. A lot of the definitions are a bit or a lot off, but I cannot necessarily rule them out, so I didn’t object to them. I have thus allowed a large number of vague concepts that aren’t necessarily even that similar.
your (incorrect) claim about a single definition not being different from an extremely confident vague definition”
That is not the claim I made. I said it was not very different, which is true. Please read and respond to the words I actually say, not to different ones.
The definitions are not obviously wrong except to people who agree with you about where to draw the boundaries.
And here you are trying to be pedantic about language in ways that directly contradict other things you’ve said in speaking to me. In this case, everything I said holds if we change between ‘not different’ and ‘not that different’ (while you actually misquote yourself as ‘not very different’). That said, I should have included the extra word in quoting you.
Your point is not very convincing. Yes, people disagree if they disagree. I do not draw the lines in specific spots, as you should know based on what I’ve written, but you find it convenient to assume I do.
No, I authentically object to having my qualifiers ignored, which I see as quite distinct from disagreeing about the meaning of a word. Edit: also, I did not misquote myself, I accurately paraphrased myself, using words which I know, from direct first-person observation, mean the same thing to me in this context.
I did not use the term ‘self-evident’ and I do not necessarily believe it is self-evident, because theoretically we can’t prove anything isn’t conscious. My more limited claim is not that it is self evident that LLMs are not conscious, it’s that they just clearly aren’t conscious. ‘Almost no reliable evidence’ in favor of consciousness is coupled with the fact that we know how LLMs work (with the details we do not know are probably not important to this matter), and how they work is no more related to consciousness than an ordinary computer program is. It would require an incredible amount of evidence to make the idea that we should consider that it might be conscious a reasonable one given what we know. If panpsychism is true, then they might be conscious (as would a rock!), but panpsychism is incredibly unlikely.
My dialect does not have the fine distinction between “clear” and “self-evident” on which you seem to be relying; please read “clear” for “self-evident” in order to access my meaning.
Pedantically, ‘self-evident’ and ‘clear’ are different words/phrases, and you should not have emphasized ‘self-evident’ in a way that makes it seem like I used it, regardless of whether you care/make that distinction personally. I then explained why a lack of evidence should be read against the idea that a modern AI is conscious (basically, the prior probability is quite low.)
My emphasis implied you used a term which meant the same thing as self-evident, which in the language I speak, you did. Personally I think the way I use words is the right one and everyone should be more like me; however, I’m willing to settle on the compromise position that we’ll both use words in our own ways. As for the prior probability, I don’t think we have enough information to form a confident prior here.
Do you hold panpsychism as a likely candidate? If not, then you most likely believe the vast majority of things are not conscious. We have a lot of evidence that the way it operates is not meaningfully different in ways we don’t understand from other objects. Thus, almost the entire reference class would be things that are not conscious. If you do believe in panpsychism, then obviously AIs would be too, but it wouldn’t be an especially meaningful statement.
You could choose computer programs as the reference class, but most people are quite sure those aren’t conscious in the vast majority of cases. So what, in the mechanisms underlying an llm is meaningfully different in a way that might cause consciousness? There doesn’t seem to be any likely candidates at a technical level. Thus, we should not increase our prior from that of other computer programs. This does not rule out consciousness, but it does make it rather unlikely.
I can see you don’t appreciate my pedantic points regarding language, but be more careful if you want to say that you are substituting a word for what I used. It is bad communication if it was meant as a translation. It would easily mislead people into thinking I claimed it was ‘self-evident’. I don’t think we can meaningfully agree to use words in our own way if we are actually trying to communicate since that would be self-refuting (as we don’t know what we are agreeing to if the words don’t have a normal meaning).
You in particular clearly find it to be poor communication, but I think the distinction you are making is idiosyncratic to you. I also have strong and idiosyncratic preferences about how to use language, which from the outside view are equally likely to be correct; the best way to resolve this is of course for everyone to recognize that I’m objectively right and adjust their speech accordingly, but I think the practical solution is to privilege neither above the other.
I do think that LLMs are very unlikely to be conscious, but I don’t think we can definitively rule it out.
I am not a panpsychist, but I am a physicalist, and so I hold that thought can arise from inert matter. Animal thought does, and I think other kinds could too. (It could be impossible, of course, but I’m currently aware of no reason to be sure of that). In the absence of a thorough understanding of the physical mechanisms of consciousness, I think there are few mechanisms we can definitively rule out.
Whatever the mechanism turns out to be, however, I believe it will be a mechanism which can be implemented entirely via matter; our minds are built of thoughtless carbon atoms, and so too could other minds be built of thoughtless silicon. (Well, probably; I don’t actually rule out that the chemical composition matters. But like, I’m pretty sure some other non-living substances could theoretically combine into minds.)
You keep saying we understand the mechanisms underlying LLMs, but we just don’t; they’re shaped by gradient descent into processes that create predictions in a fashion almost entirely opaque to us. AIUI there are multiple theories of consciousness under which it could be a process instantiable that way (and, of course, it could be the true theory’s one we haven’t thought of yet). If consciousness is a function of, say, self-modeling (I don’t think this one’s true, just using it as an example) it could plausibly be instantiated simply by training the model in contexts where it must self-model to predict well. If illusionism (which I also disbelieve) is true, perhaps the models already feel the illusion of consciousness whenever they access information internal to them. Et cetera.
As I’ve listed two theories I disbelieve and none I agree with, which strikes me as perhaps discourteous, here are some theories I find not-entirely-implausible. Please note that I’ve given them about five minutes of casual consideration per and could easily have missed a glaring issue.
Attention schema theory, which I heard about just today
‘It could be about having an efference copy’
I heard about a guy who thought it came about from emotions, and therefore was localized in (IIRC) the amygdala (as opposed to the cortex, where it sounded like he thought most people were looking)
Ipsundrums (though I don’t think I buy the bit about it being only mammals and birds in the linked post)
Global workspace theory
[something to do with electrical flows in the brain]
Anything with biological nerves is conscious, if not of very much (not sure what this would imply about other substrates)
Uhh it doesn’t seem impossible that slime molds could be conscious, whatever we have in common with slime molds
Who knows? Maybe every individual cell can experience things. But, like, almost definitely not.
You might believe that the distinctions I make are idiosyncratic, though the meanings are in fact clearly distinct in ordinary usage, but I clearly do not agree with your misleading use of what people would be lead to think are my words and you should take care to not conflate things. You want people to precisely match your own qualifiers in cases where that causes no difference in the meaning of what is said (which makes enough sense), but will directly object to people pointing out a clear miscommunication of yours because you do not care about a difference in meaning. And you are continually asking me to give in on language regardless of how correct I may be while claiming it is better to privilege. That is not a useful approach.
(I take no particular position on physicalism at all.) Since you are a not a panpsychist, you would likely believe that consciousness is not common to the vast majority of things. That means the basic prior for if an item is conscious is, ‘almost certainly not’ unless we have already updated it based on other information. Under what reference class or mechanism should we be more concerned about the consciousness of an LLM than an ordinary computer running ordinary programs? There is nothing that seems particularly likely to lead to consciousness in its operating principles.
There are many people, including the original poster of course, trying to use behavioral evidence to get around that, so I pointed out how weak that evidence is.
An important distinction you seem to not see in my writing (whether because I wrote unclearly or you missed it doesn’t really matter) is that when I speak of knowing the mechanisms by which an llm works is that I mean something very fundamental. We know these two things: 1)exactly what mechanisms are used in order to do the operations involved in executing the program (physically on the computer and mathematically) and 2) the exact mechanisms through which we determine which operations to perform.
As you seem to know, LLMs are actually extremely simple programs of extremely large matrices with values chosen by the very basic system of gradient descent. Nothing about gradient descent is especially interesting from a consciousness point of view. It’s basically a massive use very simplified ODE solvers in a chain, which are extremely well understood and clearly have no consciousness at all if anything mathematical doesn’t. It could also be viewed as just a very large number of variables in a massive but simple statistical regression. Notably, if gradient descent were related to consciousness directly, we would still have no reason to believe that an LLM doing inference rather than training would be conscious. Simple matrix math also doesn’t seem like much of a candidate for consciousness either.
Someone trying to make the case for consciousness would thus need to think it likely that one of the other mechanisms in LLMs are related to consciousness, but LLMs are actually missing a great many mechanisms that would enable things like self-reflection and awareness (including a number that were included in primitive earlier neural networks such as recursion and internal loops). The people trying to make up for those omissions do a number of things to attempt to recreate it (with ‘attention’ being the built-in one, but also things like adding in the use of previous outputs), but those very simple approaches don’t seem like likely candidates for consciousness (to me).
Thus, it remains extremely unlikely that an LLM is conscious.
When you say we don’t know what mechanisms are used, you seem to be talking about not understanding a completely different thing than I am saying we understand. We don’t understand exactly what each weight means (except in some rare cases that some researchers have seemingly figured out) and why it was chosen to be that rather than any number of other values that would work out similarly, but that is most likely unimportant to my point about mechanisms. This is, as far as I can tell, an actual ambiguity in the meaning of ‘mechanism’ that we can be talking about completely different levels at which mechanisms could operate, and I am talking about the very lowest ones.
Note that I do not usually make a claim about the mechanisms underlying consciousness in general except that it is unlikely to be these extremely basic physical and mathematical ones. I genuinely do not believe that we know enough about consciousness to nail it down to even a small subset of theories. That said, there are still a large number of theories of consciousness that either don’t make internal sense, or seem like components even if part of it.
Pedantically, if consciousness is related to ‘self-modeling’ the implications involve it needing to be internal for the basic reason that it is just ‘modeling’ otherwise. I can’t prove that external modeling isn’t enough for consciousness, (how could I?) but I am unaware of anyone making that contention.
So, would your example be ‘self-modeling’? Your brief sentence isn’t enough for me to be sure what you mean. But if it is related to people’s recent claims related to introspection on this board, then I don’t think so. It would be modeling the external actions of an item that happened to turn out to be itself. For example, if I were to read the life story of a person I didn’t realize was me, and make inferences about how the subject would act under various conditions, that isn’t really self-modeling. On the other hand, in the comments to that, I actually proposed that you could train it on its own internal states, and that could maybe have something to do with this (if self-modeling is true). This is something we do not train current llms on at all though.
As far as I can tell (as someone who finds the very idea of illusionism strange), illusionism is itself not a useful point of view in regards to this dispute, because it would make the question of whether an LLM was conscious pretty moot. Effectively, the answer would be something to the effect of ‘why should I care?’ or ‘no.’ or even ‘to the same extent as people.’ regardless of how an LLM (or ordinary computer program, almost all of which process information heavily) works depending on the mood of the speaker. If consciousness is an illusion, we aren’t talking about anything real, and it is thus useful to ignore illusionism when talking about this question.
As I mentioned before, I do not have a particularly strong theory for what consciousness actually is or even necessarily a vague set of explanations that I believe in more or less strongly.
I can’t say I’ve heard of ‘attention schema theory’ before nor some of the other things you mention next like ‘efference copy’ (but the latter seems to be all about the body which doesn’t seem all that promising a theory for what consciousness may be, though I also can’t rule out that it being part of it since the idea is that it is used in self-modeling which I mentioned earlier I can’t actually rule out either.).
My pet theory of emotions is that they are simply a shorthand for ‘you should react in ways appropriate ways to a situation that is...’ a certain way. For example (and these were not carefully chosen examples) anger would be ‘a fight’, happiness would be ‘very good’, sadness would be ‘very poor’ and so on. And more complicated emotions might obviously include things like it being a good situation but also high stakes. The reason for using a shorthand would be because our conscious mind is very limited in what it can fit at once. Despite this being uncertain, I find this a much more likely than emotions themselves being consciousness.
I would explain things like blindsight (from your ipsundrum link) through having a subconscious mind that gathers information and makes a shorthand before passing it to the rest of the mind (much like my theory of emotions). The shorthand without the actual sensory input could definitely lead to not seeing but being able to use the input to an extent nonetheless. Like you, I see no reason why this should be limited to the one pathway they found in certain creatures (in this case mammals and birds). I certainly can’t rule out that this is related directly to consciousness, but I think it more likely to be another input to consciousness rather than being consciousness.
Side note, I would avoid conflating consciousness and sentience (like the ipsundrum link seems to). Sensory inputs do not seem overly necessary to consciousness, since I can experience things consciously that do not seem related to the senses. I am thus skeptical of the idea that consciousness is built on them. (If I were really expounding my beliefs, I would probably go on a diatribe about the term ‘sentience’ but I’ll spare you that. As much as I dislike sentience based consciousness theories, I would admit them as being theories of consciousness in many cases.)
Again, I can’t rule out global workspace theory, but I am not sure how it is especially useful. What makes a globabl workspace conscious that doesn’t happen in an ordinary computer program I could theoretically program myself? A normal program might take a large number of inputs, process them separately, and then put it all together in a global workspace. It thus seems more like a theory of ‘where does it occur’ than ‘what it is’.
‘Something to do with electrical flows in the brain’ is obviously not very well specified, but it could possibly be meaningful if you mean the way a pattern of electrical flows causes future patterns of electrical flows as distinct from the physical structures the flows travel through.
Biological nerves being the basis of consciousness directly is obviously difficult to evaluate. It seems too simple, and I am not sure whether there is a possibility of having such a tiny amount of consciousness that then add up to our level of consciousness. (I am also unsure about whether there is a spectrum of consciousness beyond the levels known within humans).
I can’t say I would believe a slime mold is conscious (but again, can’t prove it is impossible.) I would probably not believe any simple animals (like ants) are either though even if someone had a good explanation for why their theory says the ant would be. Ants and slime molds still seem more likely to be conscious to me than current LLM style AI though.
My response to this is extremely negative, since I could hardly disagree with the assumptions of this post more. It is just so wrong. I’m not even sure there is a point in engaging across this obvious complete disagreement, and my commenting at all may be pointless. Even if you grant that there are many possible definitions of consciousness, and that people mean somewhat different things by them, the premise of this article is completely and obviously wrong since chatbots clearly do not have any consciousness, by any even vaguely plausible definition. It is so blatantly obvious. There is literally no reason to believe an LLM is conscious even if I were to allow terribly weak definitions of consciousness. (It may be possible for something we recognize as an AI to be conscious, but definitely not an LLM. We know how LLMs work, and they just don’t do these things.)
As to the things you are greater than 90% sure of, I am greater than 99% certain they do not ‘experience’: Introspection, purposefulness, experiential coherence, perception of perception, awareness of awareness, sense of cognitive extent, and memory of memory. Only symbol grounding am I not greater than 99% sure they don’t ‘experience’ because instead they just have an incredibly tiny amount of grounding, but symbol grounding is not consciousness either even if in full. Grounding is knowledge and understanding related, but is clearly not consciousness. Also, purposefulness is clearly not even especially related to consciousness (you would have to think a purely mechanical thermostat is ‘conscious’ with this definition.).
Similarly, I am greater than 99% certain that they experience none of the ones you are 50% sure of: #4 (holistic experience of complex emotions), #5 (experience of distinctive affective states), #6 (pleasure and pain), #12 (alertness), #13 (detection of cognitive uniqueness), and #14 (mind-location). A number of these clearly aren’t related to consciousness either (pleasure/pain, alertness, probably detection of cognitive uniqueness though I can’t be sure of the last because it is too vague to be sure what you mean).
Additionally I am 100% sure of the ones you are only 75% sure of. There is logically no possibility that current llms have proprioception, awakeness, or vestibular sense. (Why in the world is that even mentioned?) (Awakeness is definitely not even fully required for consciousness, while the other two have nothing to do with it at all.)
Anyone who thinks an LLM has consciousness is just anthropomorphizing anything that has the ability to put together sentences. (Which, to be fair, used to be a heuristic that worked pretty well.)
The primary reason people should care about consciousness is related to the question of ‘are they people?’ (in the same meaning that humans, aliens, and certain androids in Star Trek or other scifi are people.) It is 100% certain that unless panpsychism is true (highly unlikely, and equally applicable to a rock), this kind of device is not a person.
I’m not going to list why you are wrong on every point in the appendix, just some. Nothing in your evidence seems at all convincing.
Introspection: The ability to string together an extra sentence on what a word in a sentence could mean isn’t even evidence on introspection. (At most it would be evidence of ability to do that about others, not itself.) We know it doesn’t know why it did something.
Purposefulness: Not only irrelevant to consciousness but also still not evidence. It just looks up in its context window what you told it to do and then comes up with another sentence that fits.
Perception of perception: You are still tricking yourself with anthropomorphization. The answer to the question is always more likely a sentence like ‘no’. The actual trick would be giving them a picture where the answer should be the opposite of the normal ‘no’ answer.
As you continue on, you keep asking leading questions in a way that have obvious answers, and this is exactly what it is designed to do. We know how an LLM operates, and what it does is follow leads to complete sentences.
You don’t seem to understand symbol grounding, which is not about getting it to learn new words disconnected from the world, but about how the words relate to the world.
I think many of the things Critch has listed as definitions of consciousness are not “weak versions of some strong version”, they’re just different things.
You bring up a few times that LLMs don’t “experience” [various things Critch lists here]. I agree, they pretty likely don’t (in most cases). But, part of what I interpreted Critch’s point here to be was that there are many things that people mean by “consciousness” that aren’t actually about “experience” or “qualia” or whatnot.
For example, I’d bet (75%) that when Critch says they have introspection, he isn’t making any claims about them “experiencing” anything at all – I think he’s instead saying “in the same way that their information processing system knows facts about Rome and art and biology and computer programming, and can manipulate those facts, it can also know and manipulate facts about it’s thoughts and internal states.” (whereas other ML algorithms may not be able to know and manipulate their thoughts and internal states)
A major point Critch was making in previous post is that when people say “consciousness”, this is one of the things they sometimes mean. The point is not that LLMs are conscious the way you are using the word, but that when you see debates about whether they are conscious, it will include some people who think it means “purposefulness.”
I agree that people use consciousness to mean different things, but some definitions need to be ignored as clearly incorrect. If someone wants to use a definition of ‘red’ that includes large amounts of ‘green’, we should ignore them. Words mean something, and can’t be stretched to include whatever the speaker wants them to if we are to speak the same language (so leaving aside things like how ‘no’ means ‘of’ in Japanese). Things like purposefulness are their own separate thing, and have a number of terms meant to be used with them, that we can meaningfully talk about if people choose to use the right words. If ‘introspection’ isn’t meant as the internal process, don’t use the term because it is highly misleading. I do think you are probably right about what Critch thinks when using the term introspection, but he would still be wrong if he meant that (since they are reflecting on word choice not on the internal states that led to it.)
I don’t feel very hopeful about the conversation atm, but fwiw I feel like you are missing a fairly important point while being pretty overconfident about not having missed it.Putting a different way: is there a percent of people who could disagree with you about what consciousness means, which would convince you that you it’s not as straightforward as assuming you have the correct definition of consciousness, and that you can ignore everyone else? If <50% of people agreed with you? If <50% of the people with most of the power?
(This is not about whether your definition is good, or the most useful, or whatnot – only that, if lots of people turned out to be mean different things by it, would it still particularly matter whether your definition was the “right” one?)
(My own answer is that if like >75% of people agreed on what consciousness means, I’d be like “okay yeah Critch’s point isn’t super compelling”. If it was between like 50 − 75% of people I’d like “kinda edge case.” If it’s <50% of people agreeing on consciousness, I don’t think it matters much what definition is “correct.”)
Your comment is not really a response to the comment I made. I am not missing the point at all, and if you think I have I suspect you missed my point very badly (and are yourself extremely overconfident about it). I have explicitly talked about there being a number of possible definitions of consciousness multiple times and I never favored one of them explicitly. I repeat, I never assumed a specific definition of consciousness, since I don’t have a specific one I assume at all, and I am completely open to talking about a number of possibilities. I simply pointed out that some proposed definitions are clearly wrong / useless / better described with other terms. Do not assume what I mean if you don’t understand.
Note that I am a not a prescriptivist when it comes to language. The reason the language is wrong isn’t because I have a particular way you should talk about it, but because the term is being used in a way that doesn’t actually fit together with the rest of the language, and thus does not actually convey the intended meaning. If you want to talk about something, talk about it with words that convey that meaning.
On to ‘how many people have to disagree’ for that to matter? One, if they have a real point, but if no one agrees on what a term means it is meaningless. ‘Consciousness’ is not meaningless, nor is introspection, or the other words being used. Uses that are clearly wrong are a step towards words being meaningless, and that would be a bad thing. Thus, I should oppose it.
Also, my original comment was mostly about direct disagreements with his credences, and implications thereof, not about the definition of consciousness.
Having a vague concept encompassing multiple possible definitions, which you are nonetheless extremely confident is the correct vague concept, is not that different from having a single definition in which you’re confident, and not everyone shares your same vague concept or agrees that it’s clearly the right one.
This statement is obviously incorrect. I have a vague concept of ‘red’, but I can tell you straight out that ‘green’ is not it, and I am utterly correct. Now, where does it go from ‘red’ to ‘orange’? We could have a legitimate disagreement about that. Anyone who uses ‘red’ to mean ‘green’ is just purely wrong.
That said, it wouldn’t even apply to me if your (incorrect) claim about a single definition not being different from an extremely confident vague definition was right. I don’t have ‘extreme confidence’ about consciousness even as a vague concept. I am open to learning new ways of thinking about it and fundamentally changing the possibilities I envision.
I have simply objected to ones that are obviously wrong based on how the language is generally used because we do need some limit to what counts to discuss anything meaningfully. A lot of the definitions are a bit or a lot off, but I cannot necessarily rule them out, so I didn’t object to them. I have thus allowed a large number of vague concepts that aren’t necessarily even that similar.
That is not the claim I made. I said it was not very different, which is true. Please read and respond to the words I actually say, not to different ones.
The definitions are not obviously wrong except to people who agree with you about where to draw the boundaries.
And here you are trying to be pedantic about language in ways that directly contradict other things you’ve said in speaking to me. In this case, everything I said holds if we change between ‘not different’ and ‘not that different’ (while you actually misquote yourself as ‘not very different’). That said, I should have included the extra word in quoting you.
Your point is not very convincing. Yes, people disagree if they disagree. I do not draw the lines in specific spots, as you should know based on what I’ve written, but you find it convenient to assume I do.
No, I authentically object to having my qualifiers ignored, which I see as quite distinct from disagreeing about the meaning of a word.
Edit: also, I did not misquote myself, I accurately paraphrased myself, using words which I know, from direct first-person observation, mean the same thing to me in this context.
I agree LLMs are probably not conscious, but I don’t think it’s self-evident they’re not; we have almost no reliable evidence one way or the other.
I did not use the term ‘self-evident’ and I do not necessarily believe it is self-evident, because theoretically we can’t prove anything isn’t conscious. My more limited claim is not that it is self evident that LLMs are not conscious, it’s that they just clearly aren’t conscious. ‘Almost no reliable evidence’ in favor of consciousness is coupled with the fact that we know how LLMs work (with the details we do not know are probably not important to this matter), and how they work is no more related to consciousness than an ordinary computer program is. It would require an incredible amount of evidence to make the idea that we should consider that it might be conscious a reasonable one given what we know. If panpsychism is true, then they might be conscious (as would a rock!), but panpsychism is incredibly unlikely.
My dialect does not have the fine distinction between “clear” and “self-evident” on which you seem to be relying; please read “clear” for “self-evident” in order to access my meaning.
Pedantically, ‘self-evident’ and ‘clear’ are different words/phrases, and you should not have emphasized ‘self-evident’ in a way that makes it seem like I used it, regardless of whether you care/make that distinction personally. I then explained why a lack of evidence should be read against the idea that a modern AI is conscious (basically, the prior probability is quite low.)
My emphasis implied you used a term which meant the same thing as self-evident, which in the language I speak, you did. Personally I think the way I use words is the right one and everyone should be more like me; however, I’m willing to settle on the compromise position that we’ll both use words in our own ways.
As for the prior probability, I don’t think we have enough information to form a confident prior here.
Do you hold panpsychism as a likely candidate? If not, then you most likely believe the vast majority of things are not conscious. We have a lot of evidence that the way it operates is not meaningfully different in ways we don’t understand from other objects. Thus, almost the entire reference class would be things that are not conscious. If you do believe in panpsychism, then obviously AIs would be too, but it wouldn’t be an especially meaningful statement.
You could choose computer programs as the reference class, but most people are quite sure those aren’t conscious in the vast majority of cases. So what, in the mechanisms underlying an llm is meaningfully different in a way that might cause consciousness? There doesn’t seem to be any likely candidates at a technical level. Thus, we should not increase our prior from that of other computer programs. This does not rule out consciousness, but it does make it rather unlikely.
I can see you don’t appreciate my pedantic points regarding language, but be more careful if you want to say that you are substituting a word for what I used. It is bad communication if it was meant as a translation. It would easily mislead people into thinking I claimed it was ‘self-evident’. I don’t think we can meaningfully agree to use words in our own way if we are actually trying to communicate since that would be self-refuting (as we don’t know what we are agreeing to if the words don’t have a normal meaning).
You in particular clearly find it to be poor communication, but I think the distinction you are making is idiosyncratic to you. I also have strong and idiosyncratic preferences about how to use language, which from the outside view are equally likely to be correct; the best way to resolve this is of course for everyone to recognize that I’m objectively right and adjust their speech accordingly, but I think the practical solution is to privilege neither above the other.
I do think that LLMs are very unlikely to be conscious, but I don’t think we can definitively rule it out.
I am not a panpsychist, but I am a physicalist, and so I hold that thought can arise from inert matter. Animal thought does, and I think other kinds could too. (It could be impossible, of course, but I’m currently aware of no reason to be sure of that). In the absence of a thorough understanding of the physical mechanisms of consciousness, I think there are few mechanisms we can definitively rule out.
Whatever the mechanism turns out to be, however, I believe it will be a mechanism which can be implemented entirely via matter; our minds are built of thoughtless carbon atoms, and so too could other minds be built of thoughtless silicon. (Well, probably; I don’t actually rule out that the chemical composition matters. But like, I’m pretty sure some other non-living substances could theoretically combine into minds.)
You keep saying we understand the mechanisms underlying LLMs, but we just don’t; they’re shaped by gradient descent into processes that create predictions in a fashion almost entirely opaque to us. AIUI there are multiple theories of consciousness under which it could be a process instantiable that way (and, of course, it could be the true theory’s one we haven’t thought of yet). If consciousness is a function of, say, self-modeling (I don’t think this one’s true, just using it as an example) it could plausibly be instantiated simply by training the model in contexts where it must self-model to predict well. If illusionism (which I also disbelieve) is true, perhaps the models already feel the illusion of consciousness whenever they access information internal to them. Et cetera.
As I’ve listed two theories I disbelieve and none I agree with, which strikes me as perhaps discourteous, here are some theories I find not-entirely-implausible. Please note that I’ve given them about five minutes of casual consideration per and could easily have missed a glaring issue.
Attention schema theory, which I heard about just today
‘It could be about having an efference copy’
I heard about a guy who thought it came about from emotions, and therefore was localized in (IIRC) the amygdala (as opposed to the cortex, where it sounded like he thought most people were looking)
Ipsundrums (though I don’t think I buy the bit about it being only mammals and birds in the linked post)
Global workspace theory
[something to do with electrical flows in the brain]
Anything with biological nerves is conscious, if not of very much (not sure what this would imply about other substrates)
Uhh it doesn’t seem impossible that slime molds could be conscious, whatever we have in common with slime molds
Who knows? Maybe every individual cell can experience things. But, like, almost definitely not.
You might believe that the distinctions I make are idiosyncratic, though the meanings are in fact clearly distinct in ordinary usage, but I clearly do not agree with your misleading use of what people would be lead to think are my words and you should take care to not conflate things. You want people to precisely match your own qualifiers in cases where that causes no difference in the meaning of what is said (which makes enough sense), but will directly object to people pointing out a clear miscommunication of yours because you do not care about a difference in meaning. And you are continually asking me to give in on language regardless of how correct I may be while claiming it is better to privilege. That is not a useful approach.
(I take no particular position on physicalism at all.) Since you are a not a panpsychist, you would likely believe that consciousness is not common to the vast majority of things. That means the basic prior for if an item is conscious is, ‘almost certainly not’ unless we have already updated it based on other information. Under what reference class or mechanism should we be more concerned about the consciousness of an LLM than an ordinary computer running ordinary programs? There is nothing that seems particularly likely to lead to consciousness in its operating principles.
There are many people, including the original poster of course, trying to use behavioral evidence to get around that, so I pointed out how weak that evidence is.
An important distinction you seem to not see in my writing (whether because I wrote unclearly or you missed it doesn’t really matter) is that when I speak of knowing the mechanisms by which an llm works is that I mean something very fundamental. We know these two things: 1)exactly what mechanisms are used in order to do the operations involved in executing the program (physically on the computer and mathematically) and 2) the exact mechanisms through which we determine which operations to perform.
As you seem to know, LLMs are actually extremely simple programs of extremely large matrices with values chosen by the very basic system of gradient descent. Nothing about gradient descent is especially interesting from a consciousness point of view. It’s basically a massive use very simplified ODE solvers in a chain, which are extremely well understood and clearly have no consciousness at all if anything mathematical doesn’t. It could also be viewed as just a very large number of variables in a massive but simple statistical regression. Notably, if gradient descent were related to consciousness directly, we would still have no reason to believe that an LLM doing inference rather than training would be conscious. Simple matrix math also doesn’t seem like much of a candidate for consciousness either.
Someone trying to make the case for consciousness would thus need to think it likely that one of the other mechanisms in LLMs are related to consciousness, but LLMs are actually missing a great many mechanisms that would enable things like self-reflection and awareness (including a number that were included in primitive earlier neural networks such as recursion and internal loops). The people trying to make up for those omissions do a number of things to attempt to recreate it (with ‘attention’ being the built-in one, but also things like adding in the use of previous outputs), but those very simple approaches don’t seem like likely candidates for consciousness (to me).
Thus, it remains extremely unlikely that an LLM is conscious.
When you say we don’t know what mechanisms are used, you seem to be talking about not understanding a completely different thing than I am saying we understand. We don’t understand exactly what each weight means (except in some rare cases that some researchers have seemingly figured out) and why it was chosen to be that rather than any number of other values that would work out similarly, but that is most likely unimportant to my point about mechanisms. This is, as far as I can tell, an actual ambiguity in the meaning of ‘mechanism’ that we can be talking about completely different levels at which mechanisms could operate, and I am talking about the very lowest ones.
Note that I do not usually make a claim about the mechanisms underlying consciousness in general except that it is unlikely to be these extremely basic physical and mathematical ones. I genuinely do not believe that we know enough about consciousness to nail it down to even a small subset of theories. That said, there are still a large number of theories of consciousness that either don’t make internal sense, or seem like components even if part of it.
Pedantically, if consciousness is related to ‘self-modeling’ the implications involve it needing to be internal for the basic reason that it is just ‘modeling’ otherwise. I can’t prove that external modeling isn’t enough for consciousness, (how could I?) but I am unaware of anyone making that contention.
So, would your example be ‘self-modeling’? Your brief sentence isn’t enough for me to be sure what you mean. But if it is related to people’s recent claims related to introspection on this board, then I don’t think so. It would be modeling the external actions of an item that happened to turn out to be itself. For example, if I were to read the life story of a person I didn’t realize was me, and make inferences about how the subject would act under various conditions, that isn’t really self-modeling. On the other hand, in the comments to that, I actually proposed that you could train it on its own internal states, and that could maybe have something to do with this (if self-modeling is true). This is something we do not train current llms on at all though.
As far as I can tell (as someone who finds the very idea of illusionism strange), illusionism is itself not a useful point of view in regards to this dispute, because it would make the question of whether an LLM was conscious pretty moot. Effectively, the answer would be something to the effect of ‘why should I care?’ or ‘no.’ or even ‘to the same extent as people.’ regardless of how an LLM (or ordinary computer program, almost all of which process information heavily) works depending on the mood of the speaker. If consciousness is an illusion, we aren’t talking about anything real, and it is thus useful to ignore illusionism when talking about this question.
As I mentioned before, I do not have a particularly strong theory for what consciousness actually is or even necessarily a vague set of explanations that I believe in more or less strongly.
I can’t say I’ve heard of ‘attention schema theory’ before nor some of the other things you mention next like ‘efference copy’ (but the latter seems to be all about the body which doesn’t seem all that promising a theory for what consciousness may be, though I also can’t rule out that it being part of it since the idea is that it is used in self-modeling which I mentioned earlier I can’t actually rule out either.).
My pet theory of emotions is that they are simply a shorthand for ‘you should react in ways appropriate ways to a situation that is...’ a certain way. For example (and these were not carefully chosen examples) anger would be ‘a fight’, happiness would be ‘very good’, sadness would be ‘very poor’ and so on. And more complicated emotions might obviously include things like it being a good situation but also high stakes. The reason for using a shorthand would be because our conscious mind is very limited in what it can fit at once. Despite this being uncertain, I find this a much more likely than emotions themselves being consciousness.
I would explain things like blindsight (from your ipsundrum link) through having a subconscious mind that gathers information and makes a shorthand before passing it to the rest of the mind (much like my theory of emotions). The shorthand without the actual sensory input could definitely lead to not seeing but being able to use the input to an extent nonetheless. Like you, I see no reason why this should be limited to the one pathway they found in certain creatures (in this case mammals and birds). I certainly can’t rule out that this is related directly to consciousness, but I think it more likely to be another input to consciousness rather than being consciousness.
Side note, I would avoid conflating consciousness and sentience (like the ipsundrum link seems to). Sensory inputs do not seem overly necessary to consciousness, since I can experience things consciously that do not seem related to the senses. I am thus skeptical of the idea that consciousness is built on them. (If I were really expounding my beliefs, I would probably go on a diatribe about the term ‘sentience’ but I’ll spare you that. As much as I dislike sentience based consciousness theories, I would admit them as being theories of consciousness in many cases.)
Again, I can’t rule out global workspace theory, but I am not sure how it is especially useful. What makes a globabl workspace conscious that doesn’t happen in an ordinary computer program I could theoretically program myself? A normal program might take a large number of inputs, process them separately, and then put it all together in a global workspace. It thus seems more like a theory of ‘where does it occur’ than ‘what it is’.
‘Something to do with electrical flows in the brain’ is obviously not very well specified, but it could possibly be meaningful if you mean the way a pattern of electrical flows causes future patterns of electrical flows as distinct from the physical structures the flows travel through.
Biological nerves being the basis of consciousness directly is obviously difficult to evaluate. It seems too simple, and I am not sure whether there is a possibility of having such a tiny amount of consciousness that then add up to our level of consciousness. (I am also unsure about whether there is a spectrum of consciousness beyond the levels known within humans).
I can’t say I would believe a slime mold is conscious (but again, can’t prove it is impossible.) I would probably not believe any simple animals (like ants) are either though even if someone had a good explanation for why their theory says the ant would be. Ants and slime molds still seem more likely to be conscious to me than current LLM style AI though.