Stuart_Armstrong comments on The flawed Turing test: language, understanding, and partial p-zombies

Stuart_Armstrong 17 May 2013 17:48 UTC
0 points

First, you mean “pass”, not “past”

Oops! Now corrected.

Second, what definition of “conscious” are you using here?

I’m not using one. Part of the problem is that the Turing test is measuring something, but it’s not entirely clear what.
- Lumifer 17 May 2013 20:14 UTC
  3 points
  Parent
  Surely it is clear what the Turing test is measuring. It is measuring the ability to pass for a human under certain conditions.
  
  A better question is whether (and in what way) does the ability to pass for a human correlate with other qualities of interest, notably ones which we vaguely describe as “intelligent” or “conscious”.
  - bogdanb 17 May 2013 23:22 UTC
    5 points
    Parent
    
    does the ability to pass for a human correlate with [qualities] which we vaguely describe as “intelligent” or “conscious”[?]
    
    I always thought (and was very convinced in my belief, though I can’t seem to think of a reason why now) that the Turing test was explicitly designed as a “sufficient” rather than a “necessary” kind of test. As in, you don’t need to pass it to be “human-level”, but if you do then you certainly are. (Or, more precisely, as long as we’ve established we can’t tell, then who cares? With a similar sentiment for exactly what it was we’re comparing for “human-level”: it’s something about how smarter we are than monkeys, we’re not sure quite what it is, but we can’t tell the difference, so you’re in.) A brute-force, first-try, upper-bound sort of test.
    
    But I get the feeling from some of the comments that it claims more than that (or maybe doesn’t disclaim as much). Am I missing some literature or something?
    What links here?
    bogdanb's comment on The flawed Turing test: language, understanding, and partial p-zombies by Stuart_Armstrong (17 May 2013 23:34 UTC; 1 point)
    - Bugmaster 18 May 2013 0:10 UTC
      5 points
      Parent
      I personally agree with your comment (assuming I understand it correctly). As far as I can tell, however, some people believe that merely being able to converse with humans on their own level is not sufficient to establish the agent’s ability to think on the human level. I personally think this belief is misguided, since it privileges implementation details over function, but I could always be wrong.
    - TheOtherDave 18 May 2013 1:43 UTC
      1 point
      Parent
      IIRC, Turing introduces the concept in the paper as a sufficient but not necessary condition, as you describe here.
    - Stuart_Armstrong 18 May 2013 18:24 UTC
      0 points
      Parent
      I feel it may be neither necessary nor sufficient. It would be a pretty strong indication, but wouldn’t be enough on its own.
  - Stuart_Armstrong 17 May 2013 20:20 UTC
    2 points
    Parent
    
    A better question is whether (and in what way) does the ability to pass for a human correlate with other qualities of interest, notably ones which we vaguely describe as “intelligent” or “conscious”.
    
    Yes, that’s the issue.
    - Bugmaster 18 May 2013 0:31 UTC
      0 points
      Parent
      Is there any way we can test for consciousness without using some version of the Turing Test ? If the answer is “no”, then I don’t see the point of caring about it.
      
      As for “intelligence”, it’s a little trickier. There could be agents out there who are generally intelligent yet utterly inhuman. The Turing Test would not, admittedly, apply to them.
      - Stuart_Armstrong 18 May 2013 18:25 UTC
        0 points
        Parent
        We could use those extended versions of the Turing tests I mentioned—anything that the computer hasn’t been specifically optimised on would work.
        Bugmaster 18 May 2013 20:55 UTC
        0 points
        Parent
        I am not sure what you mean by “optimized on”. What if we made an AI that was really good at both chatting and playing music ? It could pass your extended test then (while many humans, such as f.ex. myself would fail). Now what ?
        Stuart_Armstrong 20 May 2013 9:08 UTC
        0 points
        Parent
        Then I’d test it on 3d movements. The point is that these tests have great validity as test for general intelligence (or something in the vicinity), if the programmer isn’t deliberately optimising or calibrating their machine on.
        
        If you’d designed a chatterbot and it turned out to be great at playing music (and that wasn’t something you’d put in by hand), then that would be strong evidence for general intelligence.
        TheOtherDave 20 May 2013 13:55 UTC
        2 points
        Parent
        The deliberate optimization on the part of a designer is just an example of the sort of thing you are concerned about here, right? That is, if I used genetic algorithms to develop a system X, and exposed those algorithms to a set of environments E, X would be optimized for E and consequently any test centered on E (or any subset of it) would be equally unreliable as a test of general intelligence… the important thing is that because X was selected (intentionally or otherwise) to be successful at E, the fact that X is successful at E ought not be treated as evidence that X is generally intelligent.
        
        Yes?
        
        Similarly, the fact that X is successful at tasks not actually present in E, but nevertheless very similar to tasks present in E, ought not be treated as evidence that X is generally intelligent. A small amount of generalization from initial inputs is not that impressive.
        
        The question then becomes how much generalization away from the specific problems presented in E is necessary before we consider X generally intelligent.
        
        To approach the question differently—there are all kinds of cognitive tests which humans fail, because our cognitive systems just weren’t designed to handle the situations those tests measure, because our ancestral environment didn’t contain sufficiently analogous situations. At what point do we therefore conclude that humans aren’t really generally intelligent, just optimized for particular kinds of tests?