It would likely depend on whether or not self-driving cars and AI doctors need some form of reinforcement learning to work. If they do, and especially if they need to use online learning, then presumably they will need to at least partially solve issues like safe exploration, distributional shift, avoiding side effects, verification and validation of RL policies, etc. It also seems likely that they would need to solve versions of specification gaming to ensure that the RL agent doesn’t do weird things in edge cases because the reward function wasn’t perfectly specified. Whether or not such partial solutions would scale up to AGI is a different discussion, as I mentioned.
It would likely depend on whether or not self-driving cars and AI doctors need some form of reinforcement learning to work. If they do, and especially if they need to use online learning, then presumably they will need to at least partially solve issues like safe exploration, distributional shift, avoiding side effects, verification and validation of RL policies, etc. It also seems likely that they would need to solve versions of specification gaming to ensure that the RL agent doesn’t do weird things in edge cases because the reward function wasn’t perfectly specified. Whether or not such partial solutions would scale up to AGI is a different discussion, as I mentioned.