the gears to ascension comments on The surprising parameter efficiency of vision models

the gears to ascension 9 Apr 2023 18:20 UTC
4 points
0
sounds right, where “worse” here means “higher bit per word at predicting an existing sentence”, a very unnatural metric humans don’t spend significant effort on.
- jacob_cannell 9 Apr 2023 20:47 UTC
  4 points
  2
  Parent
  That is actually a natural metric for the brain and close to what the linguistic cortex does internally. The comparison of having a human play a word prediction game and comparing logit scores of that to the native internal logit predictions of an LLM is kinda silly. The real comparison should be between a human playing that game and LLM playing the exact same game in the exact same way (ie asking GPT verbally to predict the logit score of the next word/token), or you should comapre internal low level transformer logit scores to linear readout models from brain neural probes/scans.
  - the gears to ascension 9 Apr 2023 21:07 UTC
    2 points
    0
    Parent
    oh interesting point, yeah.