Relative difficulty of making Tool-AI, Oracle-AI, Agent-like AI : In the critical period, is the technique that produce human-level competence explicitly optimizing a reward (like in Reinforcement Learning) or is it more like GPT-3, simply outputting the most likely sequence of characters and stops there?
Further technological progress with a tool-AI still depends on human-AI collaboration, and hence this could lead to slower take-off, an agent-like AI won’t necessarily stop to leave time for the humans to think.
Relative difficulty of making Tool-AI, Oracle-AI, Agent-like AI : In the critical period, is the technique that produce human-level competence explicitly optimizing a reward (like in Reinforcement Learning) or is it more like GPT-3, simply outputting the most likely sequence of characters and stops there?
Further technological progress with a tool-AI still depends on human-AI collaboration, and hence this could lead to slower take-off, an agent-like AI won’t necessarily stop to leave time for the humans to think.