Improved versions of the Turing test seem like a natural place to start. We’ve probably learned more about what language models are capable of in the last two years (since the release of GPT-3) than in all previous years. The Feigenbaum test looks much better to me than the Loebner Silver Prize, for example.
Improved versions of the Turing test seem like a natural place to start. We’ve probably learned more about what language models are capable of in the last two years (since the release of GPT-3) than in all previous years. The Feigenbaum test looks much better to me than the Loebner Silver Prize, for example.