Vladimir_Nesov comments on Just Pivot to AI: The secret is out

Vladimir_Nesov 15 Mar 2023 19:16 UTC
3 points
0
A new kind of overhang might be brewing, scaling overhang, where optimization power of AI training grows ever greater without guidance of aligned agency, increasing the risk that shoggoths wake up. This is different from progress in capabilities. Right now, there are some increasingly intelligent human-like simulacra, but they don’t have an opportunity to act (or more to the point, study) autonomously and so can’t work towards preventing inhuman mesa-optimizers from emerging in future models, including their own models. Figuring out how to give them more agency might end up a positive change, before there are any actual inhuman agentic mesa-optimizers running around.
What links here?
- Vladimir_Nesov's comment on What does it mean for an LLM such as GPT to be aligned / good / positive impact? by PashaKamyshev (20 Mar 2023 20:01 UTC; 3 points)