Michael Tontchev comments on Sparks of Artificial General Intelligence: Early experiments with GPT-4 | Microsoft Research

Michael Tontchev 23 Mar 2023 6:28 UTC
4 points
0
Deep chain-of-though reasoning and mathematical reasoning are some of its downfalls. Are the models able to make good enough abstractions inside of themselves to resolve arbitrarily long (even if not complex) math/logical problems?
- Nathan Helm-Burger 23 Mar 2023 20:37 UTC
  9 points
  1
  Parent
  I think this is a key question. I think the answer for transformer LLMs without external memory store is currently ‘No, not arbitrarily long computations’. I have looked into this in my research and found some labs doing work which would enable this. So it’s more of a question of when the big labs will decide to take integrate these ideas developed by academic groups into SoTA models, not a question of whether it’s possible. There’s a lot of novel capabilities like this that have been demonstrated to be possible but not yet been integrated, even when trying to rule out those which might get blocked by poor scaling/parallelizability.
  So, the remaining hurdles seem to be more about engineering solutions to integration challenges, rather than innovation and proof-of-concept.
- Carl Feynman 23 Mar 2023 13:22 UTC
  4 points
  2
  Parent
  Let’s hope not.