dr_s comments on Memetic Judo #3: The Intelligence of Stochastic Parrots v.2

dr_s 17 Aug 2023 10:27 UTC
2 points
0
No, I think it’s absolutely possible, at least theoretically—not sure what would it take to actually do it of course. But that’s my point, there exists somewhere in the space of possible LLMs a “always gives you the wisest, most truthful response” model that does exactly the same thing, predicting the next token. As long as the prediction is always that of the next token that would appear in the wisest, most truthful response!
- TAG 17 Aug 2023 11:09 UTC
  2 points
  0
  Parent
  Which is different to predicting a token on the basis of the statistical regularities in the training data. An LLM that works that way is relatively poor at reliably outputting truth, so a version of the SP argument goes through.
  - dr_s 17 Aug 2023 11:11 UTC
    2 points
    0
    Parent
    I think for the limit of infinite, truthful training data, with sufficient abstraction, it would not be necessarily different. We too form our beliefs from “training data” after all, we’re just highly multimodal and smart enough to know the distinction between a science textbook and a fantasy novel. An LLM doesn’t have maybe that distinction perfectly clear—though it does grasp it to some point.
    - TAG 17 Aug 2023 12:04 UTC
      2 points
      0
      Parent
      
      We too form our beliefs from “training data”
      
      There’s no evidence that we do so based solely on token prediction, so that’s irrelevant.
      - dr_s 17 Aug 2023 13:46 UTC
        2 points
        0
        Parent
        I just don’t really understand in what way “token prediction” is anything less than “literally any possible function from a domain of all possible observations to a domain of all possible actions”. At least if your “tokens” cover extensively enough all the space of possible things you might want to do or say.
    - Max TK 17 Aug 2023 13:45 UTC
      1 point
      0
      Parent
      I think a significant part of the problem is not the LLMs trouble of distinguishing truth from fiction, it’s rather to convince it through your prompt that the output you want is the former and not the latter.