O O comments on My model of what is going on with LLMs

O O 28 Feb 2025 17:47 UTC
1 point
0
I mean, I don’t want to give Big Labs any ideas, but I suspect the reasoning above implies that the o1/deepseek -style RL procedures might work a lot better if they can think internally for a long time
I expect gpt 5 to implement this. Based on recent research and how they phrase it.
- Cole Wyeth 28 Feb 2025 18:09 UTC
  2 points
  0
  Parent
  Yes, this is the type of idea big labs will definitely already have (also what I think ~100% of the time someone says “I don’t have to give big labs any ideas”).
  - williawa 28 Feb 2025 22:03 UTC
    1 point
    2
    Parent
    That’s what I also thought haha, else I wouldn’t post it.