Andrew Burns comments on Andrew Burns’s Shortform

Andrew Burns 6 Jun 2024 18:27 UTC
3 points
−1
A Chinese company released a new SORA competitor—Kling—and it is arguably superior to SORA publically available. Could be exfiltration or could be genuinely home grown. In any case, the moat is all gone.
- cubefox 6 Jun 2024 18:38 UTC
  1 point
  0
  Parent
  Link: https://kling.kuaishou.com/
  - Andrew Burns 6 Jun 2024 18:48 UTC
    1 point
    −1
    Parent
    So US has already slipped behind despite chip limits. I also saw that Llama 3 was already bested by Qwen 2. We are about a week away from some Chinese model surpassing GPT-4o on Lmsys. I want to hear the China-is-no-big-deal folks explain this.
    - Akram Choudhary 6 Jun 2024 19:21 UTC
      2 points
      0
      Parent
      Wait till you find out that qwen 2 is probably just llama 3 with a few changes and some training on benchmarks to inflate performance a bit
      - Andrew Burns 6 Jun 2024 19:29 UTC
        1 point
        0
        Parent
        Possible. Possible. But I don’t see how that is more likely than that Alibaba just made something better. Or they made something with with lots of contamination. I think this should make us update toward not underestimating them. The Kling thing is a whole nother issue. If it is confirmed text-to-video and not something else, then we are in big trouble because the chip limits have failed.
        cubefox 6 Jun 2024 20:50 UTC
        4 points
        −3
        Parent
        For what it’s worth, Yann LeCun argues that video diffusion models like Sora, or any models which predict pixels, are useless for creating an AGI world model. So this might be a dead end. The reason, according to LeCun, is that pixel data is very high dimensional and redundant compared to text (LLMs only use something like 65.000 tokens), which makes exact prediction less useful. In his 2022 outline of his proposed AGI framework, JEPA, he instead proposes an architecture which predicts embeddings rather than exact pixels.