Max TK comments on Memetic Judo #3: The Intelligence of Stochastic Parrots v.2

Max TK 16 Aug 2023 15:52 UTC
1 point
0
I don’t really know what to make of this objection, because I have never seen the stochastic parrot argument applied to a specific, limited architecture as opposed to the general category.

Edit: Maybe make a suggestion of how to rephrase to improve my argument.
- TAG 16 Aug 2023 16:43 UTC
  2 points
  0
  Parent
  
  Maybe make a suggestion of how to rephrase to improve my argument
  
  Citation. Quote something somebody said.
- TAG 16 Aug 2023 16:30 UTC
  2 points
  0
  Parent
  
  I have never seen the stochastic parrot argument applied to a specific, limited architecture
  
  I’ve never seen anything else. According to wikipedia, the term was originally applied to LLMs.
  
  The term was first used in the paper “On the Dangers of Stochastic Parrots: Can Language Models Be Too Big?” by Bender, Timnit Gebru, Angelina McMillan-Major, and Margaret Mitchell (using the pseudonym “Shmargaret Shmitchell”).[4] ThBold text
  - Max TK 16 Aug 2023 16:39 UTC
    1 point
    0
    Parent
    LLMs use 1 or more inner layers, so shouldn’t the proof apply to them?
    - TAG 16 Aug 2023 16:43 UTC
      2 points
      0
      Parent
      what proof?
      - Max TK 16 Aug 2023 17:17 UTC
        3 points
        0
        Parent
        Of the universal approximation theorem
        TAG 16 Aug 2023 17:31 UTC
        2 points
        0
        Parent
        How are inner layers relevant?
        dr_s 17 Aug 2023 13:52 UTC
        2 points
        0
        Parent
        LLMs are neural networks, neural networks are proven to be able to approximate any function to an arbitrary close degree, hence LLMs are able to approximate any function to an arbitrary close degree (given enough layers, of course).