Jan comments on GPT-3 and concept extrapolation

Jan 20 Apr 2022 11:52 UTC
13 points
Cool experiment! I could imagine that the tokenizer handicaps GPT’s performance here (reversing the characters leads to completely different tokens). With a character-level tokenizer GPT should/might be able to handle that task better!
- gabrielrecc 21 Apr 2022 8:35 UTC
  15 points
  Parent
  I was slightly surprised to find that even fine-tuning GPT-Neo-125M for a long time on many sequences of letters followed by spaces, followed by a colon, followed by the same sequence in reverse, was not enough to get it to pick up the pattern—probably because the positional encoding vectors make the difference between e.g. “18 tokens away” and “19 tokens away” a rather subtle difference. However, I then tried fine-tuning on a similar dataset with numbers in between (e.g. “1 W 2 O 3 R 4 D 5 S : 5 S 4 D 3 R 2 O 1 W”) (or similar representation—can’t remember exactly, but something roughly like that) and it picked up the pattern right away. Data representation matters a lot!
  - Sam F. Brown 15 May 2022 18:48 UTC
    16 points
    Parent
    Here’s an example of someone prompting with a walkthrough of a similar token-aware approach to successfully guide GPT-3:
    
    https://twitter.com/npew/status/1525900849888866307
    - Stuart_Armstrong 17 May 2022 20:46 UTC
      2 points
      Parent
      I tried to use that approach to teach GPT-3 to solve the problem at the top of this post. As you can see, it kinda worked; GPT-3 grasps that some things need to be reversed, but it then goes a bit off the rails (adding a random “this is a great” to the end of my prompt, with the whole phrase reversed rather than each word; then it starts out reversing the individual words of the sentence, but ends up just completing the sentence instead, using the other common completion—“falls” rather than “stays”. Then when it tries to reverse each individual word, it fails completely, and just reorders/reworks the words a bit).
      
      Reverse the word below:
      
      Word: alphabet
      
      Reasoning:
      
      -Add spaces between letters: a l p h a b e t
      
      -Add numbers: 1:a 2:l 3:p 4:h 5:a 6:b 7:e 8:t
      
      -Reverse numbers and letters: 8:t 7:e 6:b 5:a 4:h 3:p 2:l 1:a
      
      -Remove numbers: t e b a h p l a
      
      -Merge the letters in groups of two: te ba hp la, teba hpla, tebahpla
      
      -Final result: tebahpla
      
      Reverse all words in the following phrase, complete the sentence, then reverse all words in the completed sentence:
      
      ehT niar ni niapS syats ylniam ni eht taerg a si siht
      
      The rain in Spain falls mainly on the plain.
      
      The main plain falls rain in Spain on the.
    - Stuart_Armstrong 17 May 2022 14:09 UTC
      2 points
      Parent
      Fascinating. Thanks!
    - gwern 15 May 2022 20:43 UTC
      2 points
      Parent
      This approach is a little surprising. I would have thought that adding on numbers to my space-separating approach, and then merging space-separated letters into a final solid word, would have tripped up GPT-3 and inevitably led to errors. But, at least with InstructGPT, it works.
  - Stuart_Armstrong 22 Apr 2022 11:10 UTC
    2 points
    Parent
    Thanks; very interesting result.
  - Jan 21 Apr 2022 8:42 UTC
    1 point
    Parent
    Fascinating! Thanks for sharing!
- gwern 22 Apr 2022 14:27 UTC
  9 points
  Parent
  For the similar anagram task, I found space-separating (to avoid the BPE inconsistency/nondeterminism by forcing it to encode individual letters) seemed like it helped: https://gwern.net/GPT-3-nonfiction#anagrams
  
  For this task, I think a worthwhile followup would be to experiment with the new edit mode.
- Stuart_Armstrong 20 Apr 2022 20:38 UTC
  3 points
  Parent
  Possibly! Though it did seem to recognise that the words were spelt backwards. It must have some backwards spelt words in its training data, just not that many.