A_Posthuman comments on Steelman / Ideological Turing Test of Yann LeCun’s AI X-Risk argument?

A_Posthuman 5 Apr 2023 0:58 UTC
3 points
2
“Fundamentally incapable” is perhaps putting things too strongly, when you can see from the Reflexion paper and other recent work in the past 2 weeks that humans are figuring out how to work around this issue via things like reflection/iterative prompting:
https://nanothoughts.substack.com/p/reflecting-on-reflexion
https://arxiv.org/abs/2303.11366
Using this simple approach lets GPT-4 jump from 67% to 88% correct on the HumanEval benchmark.
So I believe the lesson is: “limitations” in LLMs may turn out to be fairly easily enhanced away by clever human helpers. Therefore IMO, whether or not a particular LLM should be considered dangerous must also take into account the likely ways humans will build additional tech onto/around it to enhance it.
What links here?
- Steelman / Ideological Turing Test of Yann LeCun’s AI X-Risk argument? by Aryeh Englander (4 Apr 2023 15:53 UTC; 26 points)