Bogdan Ionut Cirstea comments on Bogdan Ionut Cirstea’s Shortform

Bogdan Ionut Cirstea Nov 27, 2024, 9:34 PM
15 points
6
QwQ-32B-Preview was released open-weights, seems comparable to o1-preview. Unless they’re gaming the benchmarks, I find it both pretty impressive and quite shocking that a 32B model can achieve this level of performance. Seems like great news vs. opaque (e.g in one-forward-pass) reasoning. Less good with respect to proliferation (there don’t seem to be any [deep] algorithmic secrets), misuse and short timelines.
- Vladimir_Nesov Nov 28, 2024, 10:22 PM
  8 points
  0
  Parent
  From proliferation perspective, it reduces overhang, makes it more likely that Llama 4 gets long reasoning trace post-training in-house rather than later, and so initial capability evaluations give more relevant results. But if Llama 4 is already training, there might not be enough time for the technique to mature, and Llamas have been quite conservative in their techniques so far.
- mrtreasure Nov 28, 2024, 12:38 AM
  1 point
  0
  Parent
  There have been comments from OAI staff that o1 is “GPT-2 level” so I wonder if it’s a similar size?
  - ShardPhoenix Nov 28, 2024, 12:44 AM
    11 points
    10
    Parent
    I think they meant that as an analogy to how developed/sophisticated it was (ie they’re saying that it’s still early days for reasoning models and to expect rapid improvement), not that the underlying model size is similar.
    - gwern Nov 28, 2024, 7:46 PM
      16 points
      2
      Parent
      IIRC OAers also said somewhere (doesn’t seem to be in the blog post, so maybe this was on Twitter?) that o1 or o1-preview was initialized from a GPT-4 (a GPT-4o?), so that would also rule out a literal parameter-size interpretation (unless OA has really brewed up some small models).
      What links here?
      o1: A Technical Primer by Jesse Hoogland (Dec 9, 2024, 7:09 PM; 169 points)
      Jesse Hoogland's comment on o1: A Technical Primer by Jesse Hoogland (Dec 10, 2024, 8:52 PM; 4 points)
      - Lee_0505 Nov 29, 2024, 3:26 PM
        1 point
        2
        Parent
        There was an article about it before the release.
        https://archive.is/IwKSP
        At the same meeting, company leadership gave a demonstration of a research project involving its GPT-4 AI model that OpenAI thinks shows some new skills that rise to human-like reasoning, according to a person familiar with the discussion who asked not to be identified because they were not authorized to speak to press.
        gwern Nov 29, 2024, 10:33 PM
        3 points
        0
        Parent
        (Relevant, although “involving its GPT-4 AI model” is a considerably weaker statement than ‘initialized from a GPT-4 checkpoint’.)