I mean, I don’t want to give Big Labs any ideas, but I suspect the reasoning above implies that the o1/deepseek -style RL procedures might work a lot better if they can think internally for a long time
I expect gpt 5 to implement this. Based on recent research and how they phrase it.
Yes, this is the type of idea big labs will definitely already have (also what I think ~100% of the time someone says “I don’t have to give big labs any ideas”).
I expect gpt 5 to implement this. Based on recent research and how they phrase it.
Yes, this is the type of idea big labs will definitely already have (also what I think ~100% of the time someone says “I don’t have to give big labs any ideas”).
That’s what I also thought haha, else I wouldn’t post it.