The ability to complete sequences is equivalent to prediction. The way GPT-3 completes sequences is it that it predicts what the next token will be and then it outputs the prediction. You can use the same model on images.
In general, the agent, based on all of its input data up to some point, tries to generate future data. If it can predict its own input data reliably that means it has a model of the world which is similar to reality. This is similar to Solomonoff induction.
Once you have a good approximation of Solomonoff induction (which is uncomputable), you combine the approximation (somehow) with reinforcement learning and expected utility maximization and get an approximation of AIXI.
Since I’m not an expert in reinforcement learning I’m not sure which part is harder, but intuition tells me the hard part of all of this would be approximating Solomonoff induction, and once you have a good world-model, it seems to me it’s relatively straightforward to maximize utility. I hope I’m wrong. (if you think I am please explain why)
See FAQ #4 on MIRI’s website below.
Edit: It was written in 2013 so it is probably best viewed as a jumping off point from which you can make further updates based on what has happened in the world since then.
Yes but GPT-3 offers us new evidence we should try to update on. It’s debatable to say how many bits of evidence that provides, but we can also update based on this Discontinuous progress in history: an update:
AlphaGo was something we saw before we expected it. GPT-3 text generator was something we saw before we expected it. They were discontinuities.
I agree. I’m not sure how much to update on the things you mention or on other things that have happened since 2013, so I think my answer serves as more of jumping off point than something authoritative. I edited it to mention that.
1. The problem with theories along the vein of AIXI is that they assume exploration is simple (as it is, in RL), but exploration is very expensive IRL
So if you want to think based on that framework, well, then AGI is as far away as it takes to build a robust simulation of the world in which we want it to operate (very far away)
2. In the world of mortals, I would say AGI is basically already here, but it’s not obvious because it’s impact is not that great.
We have ML-based systems that could in theory do almost any job, the real problem lies in the fact that they are much more expensive than humans to “get right” and in some cases (e.g. self driving) there are regulatory hurdles to cross.
The main problem with a physical human-like platform running an AGI is not that designing the algorithms for it to perform useful tasks is hard, the problem is that designing a human like platform is impossible with current technology and the closest alternatives we’ve got are still more expensive to build and maintain than just hiring a human.
Hence why companies are buying checkout machines to replace employees rather than buying checkout robots.
3. If you’re referring to “superintelligence” style AGI, i.e. something that is much more intelligent than a human, I’d argue we can’t tell how far away this is or if it can even exists (i.e. I think it’s non obvious that the bottleneck at the moment is intelligence and not physical limitations, see 1 + corrupt incentives structures, aka why smart humans are still not always used to their full potential).
I’m not sure what you mean by this. Does RL mean Reinforcement Learning, but IRL mean “in real life”. AIXI would be very efficient at using the minimum possible exploration. (And a lot of exploration can be done cheap. There is a lot of data online that can be downloaded for the cost of bandwidth, and sending a network packet to see what you get back is exploration.)