It turns out publishing bias is one heck of a drug. Every success of automation was touted, and every failure quietly tucked away, until one day the successes started getting smaller and less significant. We still see improvements around the edges of capability, but the big rocks, like making choices between domains in pursuit of abstract goals, remain elusive.
Making choices between domains in pursuit of abstract goals:
Say I have an agent with the goal of “win $ in online poker” and read/write access to the internet. Obviously that agent will simulate millions of games, and play thousands of hands online to learn more about poker and get better.
What I don’t expect to ever see (without explicit coding by a human) is that “win $ at poker” AI looking up instructional youtube videos to learn from human experts, or telling its handlers to set up additional hardware for it, or writing child AI programs with different strategies and having them play against each-other, or trading crypto during a poker game because that is another way to “win $,” or even coding and launching a new poker playing website.
I would barely expect it to find new sites where it could play, and be able to join those sites.
It turns out publishing bias is one heck of a drug. Every success of automation was touted, and every failure quietly tucked away, until one day the successes started getting smaller and less significant. We still see improvements around the edges of capability, but the big rocks, like making choices between domains in pursuit of abstract goals, remain elusive.
Hmm, can you elaborate on what you mean in the last sentence?
Making choices between domains in pursuit of abstract goals:
Say I have an agent with the goal of “win $ in online poker” and read/write access to the internet. Obviously that agent will simulate millions of games, and play thousands of hands online to learn more about poker and get better. What I don’t expect to ever see (without explicit coding by a human) is that “win $ at poker” AI looking up instructional youtube videos to learn from human experts, or telling its handlers to set up additional hardware for it, or writing child AI programs with different strategies and having them play against each-other, or trading crypto during a poker game because that is another way to “win $,” or even coding and launching a new poker playing website. I would barely expect it to find new sites where it could play, and be able to join those sites.