Agreement karma indicates agreement, separate from overall quality.
If RL becomes the next thing in improving LLM capabilities, one thing that I would bet on becoming big is computer-use in 2025. Seems hard to get more intelligence with just RL (who verifies the outputs?), but with something like computer use, it’s easy to verify if a task has been done (has the email been sent, ticket been booked etc..) that it’s starting to look to more to me like it can do self-learning.
One thing that’s left AI still fully not integrated into the rest of the economy is simply that the current interfaces were built for humans and moving all those takes engineering time / effort etc.
I’m fairly sure the economic disruption would be pretty quick once this happens. For example, I can just run 10 LLM agents to act as customer service agents using my *existing tools* - just open emails, whatsapp, and message customers, check internal dashboards etc., then it’s game over. What’s stopping people right now is that there’s not enough people to build that pipeline fast enough to utilize even the current capabilities.
2 votes
Overall karma indicates overall quality.
0 votes
Agreement karma indicates agreement, separate from overall quality.
If RL becomes the next thing in improving LLM capabilities, one thing that I would bet on becoming big is computer-use in 2025. Seems hard to get more intelligence with just RL (who verifies the outputs?), but with something like computer use, it’s easy to verify if a task has been done (has the email been sent, ticket been booked etc..) that it’s starting to look to more to me like it can do self-learning.
One thing that’s left AI still fully not integrated into the rest of the economy is simply that the current interfaces were built for humans and moving all those takes engineering time / effort etc.
I’m fairly sure the economic disruption would be pretty quick once this happens. For example, I can just run 10 LLM agents to act as customer service agents using my *existing tools* - just open emails, whatsapp, and message customers, check internal dashboards etc., then it’s game over. What’s stopping people right now is that there’s not enough people to build that pipeline fast enough to utilize even the current capabilities.