anaguma comments on o3

anaguma 21 Dec 2024 9:18 UTC
5 points
0
How have your AGI timelines changed after this announcement?
- Thane Ruthenis 21 Dec 2024 9:52 UTC
  7 points
  0
  Parent
  ~No update, priced it all in after the Q* rumors first surfaced in November 2023.
  - Alexander Gietelink Oldenziel 21 Dec 2024 10:02 UTC
    5 points
    0
    Parent
    A rumor is not the same as a demonstration.
    - Thane Ruthenis 21 Dec 2024 10:18 UTC
      6 points
      1
      Parent
      It is if you believe the rumor and can extrapolate its implications, which I did. Why would I need to wait to see the concrete demonstration that I’m sure would come, if I can instead update on the spot?
      It wasn’t hard to figure out how “something like an LLM with A*/MCTS stapled on top” would look like, or where it’d shine, or that OpenAI might be trying it and succeeding at it (given that everyone in the ML community had already been exploring this direction at the time).
      - Alexander Gietelink Oldenziel 21 Dec 2024 10:42 UTC
        9 points
        5
        Parent
        Suppose I throw up a coin but I dont show you the answer. Your friend’s cousin tells you they think the bias is ⁸⁰⁄₂₀ in favor of heads.
        
        If I show you the outcome was indeed heads should you still update ? (Yes)
        Thane Ruthenis 21 Dec 2024 10:48 UTC
        7 points
        2
        Parent
        Sure. But if you know the bias is ⁹⁵⁄₅ in favor of heads, and you see heads, you don’t update very strongly.
        And yes, I was approximately that confident that something-like-MCTS was going to work, that it’d demolish well-posed math problems, and that this is the direction OpenAI would go in (after weighing in the rumor’s existence). The only question was the timing, and this is mostly within my expectations as well.
        Alexander Gietelink Oldenziel 21 Dec 2024 14:38 UTC
        5 points
        0
        Parent
        That’s significantly outside the prediction intervals of forecasters so I will need to see an metaculus /manifold/etc account where you explicitly make this prediction sir
        Thane Ruthenis 21 Dec 2024 15:29 UTC
        9 points
        2
        Parent
        Fair! Except I’m not arguing that you should take my other predictions at face value on the basis of my supposedly having been right that one time. Indeed, I wouldn’t do that without just the sort of receipt you’re asking for! (Which I don’t have. Best I can do is a December 1, 2023 private message I sent to Zvi making correct predictions regarding what o1-3 could be expected to be, but I don’t view these predictions as impressive and it notably lacks credences.)
        I’m only countering your claim that no internally consistent version of me could have validly updated all the way here from November 2023. You’re free to assume that the actual version of me is dissembling or confabulating.
        mattmacdermott 21 Dec 2024 11:58 UTC
        4 points
        0
        Parent
        The coin coming up heads is “more headsy” than the expected outcome, but maybe o3 is about as headsy as Thane expected.
        
        Like if you had thrown 100 coins and then revealed that 80 were heads.
- Mateusz Bagiński 21 Dec 2024 9:42 UTC
  7 points
  0
  Parent
  I guess one’s timelines might have gotten longer if one had very high credence that the paradigm opened by o1 is a blind alley (relative to the goal of developing human-worker-omni-replacement-capable AI) but profitable enough that OA gets distracted from its official most ambitious goal.
  
  I’m not that person.
- Vladimir_Nesov 21 Dec 2024 20:29 UTC
  4 points
  0
  Parent
  $100-200bn 5 GW training systems are now a go. So in the worlds that slow down for years if there are only $30bn systems available and would need an additional scaling push, timelines moved up a few years. Not sure how unlikely $100-200bn systems would’ve been without o1/o3, but they seem likely now.
  - anaguma 21 Dec 2024 20:42 UTC
    1 point
    0
    Parent
    What do you think is the current cost of o3, for comparison?
    - Vladimir_Nesov 21 Dec 2024 21:01 UTC
      8 points
      0
      Parent
      In the same terms as the $100-200bn I’m talking about, o3 is probably about $1.5-5bn, meaning 30K-100K H100, the system needed to train GPT-4o or GPT-4.5o (or whatever they’ll call it) that it might be based on. But that’s the cost of a training system, its time needed for training is cheaper (since the rest of its time can be used for other things). In the other direction, it’s more expensive than just that time because of research experiments. If OpenAI spent $3bn in 2024 on training, this is probably mostly research experiments.