DragonGod comments on The surprising parameter efficiency of vision models

DragonGod 8 Apr 2023 21:34 UTC
8 points
0
IIRC Redwood research investigated human performance on next token prediction, and humans were mostly worse than even small (by current standards) language models?
- the gears to ascension 9 Apr 2023 18:20 UTC
  4 points
  0
  Parent
  sounds right, where “worse” here means “higher bit per word at predicting an existing sentence”, a very unnatural metric humans don’t spend significant effort on.
  - jacob_cannell 9 Apr 2023 20:47 UTC
    4 points
    2
    Parent
    That is actually a natural metric for the brain and close to what the linguistic cortex does internally. The comparison of having a human play a word prediction game and comparing logit scores of that to the native internal logit predictions of an LLM is kinda silly. The real comparison should be between a human playing that game and LLM playing the exact same game in the exact same way (ie asking GPT verbally to predict the logit score of the next word/token), or you should comapre internal low level transformer logit scores to linear readout models from brain neural probes/scans.
    - the gears to ascension 9 Apr 2023 21:07 UTC
      2 points
      0
      Parent
      oh interesting point, yeah.