Just as a specific prediction, does this mean you expect we will very substantially improve the cheating/lying behavior of current RL models?
I disown this prediction as “mine”, more like the prediction of one facet of me. But yeah, that facet is definitely expecting to see visible improvements in the lying and cheating behavior of reasoning models over the next few years.
Better medical tech, better entertainment, various new technologies that start out as trivialities but quickly become essential to people’s lives (like the cell phone).