Peter Wildeford comments on What’s the Least Impressive Thing GPT-4 Won’t be Able to Do

Peter Wildeford 22 Aug 2022 23:44 UTC
12 points
5
This is admittedly pretty trivial but I am 90% sure that if you prompt GPT4 with “Q: What is today’s date?” it will not answer correctly. I think something like this would literally be the least impressive thing that GPT4 won’t be able to do.
- gwern 23 Aug 2022 1:34 UTC
  9 points
  2
  Parent
  Are you really 90% sure on that? For example, LaMDA apparently has live web query access (a direction OA was also exploring with WebGPT), and could easily recognize that as a factual query worth a web query, and if you search Google for “what is today’s date?” it will of course spit back “Monday, August 22, 2022″, which even the stupidest LMs could make good use of. So your prediction would appear to boil down to “OA won’t do an obviously useful thing they already half-did and a competitor did do a year ago”.
  - Peter Wildeford 23 Aug 2022 2:49 UTC
    3 points
    0
    Parent
    Yeah ok 80%. I also do concede this is a very trivial thing, not like some “gotcha look at what stupid LMs can’t do no AGI until 2400”.
  - Lech Mazur 23 Aug 2022 2:06 UTC
    1 point
    −1
    Parent
    Well, if we’re counting things like that, this thread becomes much less interesting. They can offload math queries to a Mathematica-like software or chess playing to included Stockfish but we already know software can do this and we’re interested in novel capabilities of language or multi-modal models.
    - gwern 23 Aug 2022 16:33 UTC
      7 points
      1
      Parent
      The difference is that LaMDA/WebGPT are learning autonomously to make general use of tools (or tool AIs) provided to them as agent AIs, which is much more useful than giant piles of human-hand-engineered heuristics like in toys like Alexa or Wolfram Alpha. In my example, no one would have programmed it to know it should query Google for the current date, it has learned to exploit Google’s various features on its own, which is no more illegitimate than learning to call date (or a human learning to look at a clock, for that matter), and will extend to any other tools provided it like a Python REPL in inner-monologue work.
      - Lech Mazur 24 Aug 2022 4:10 UTC
        1 point
        0
        Parent
        Sure, it would be useful, especially if they’re gunning it to become a general chatbot assistant to take on Alexa or Google Home. But recognizing a factual query and offloading it to Google has been done by these assistants for years and it’s not something that anybody would find impressive anymore, even if the classifier was a part of a larger net.
    - Lone Pine 23 Aug 2022 9:19 UTC
      2 points
      0
      Parent
      The ability for the AI to use tools like that is both impressive and useful, though.
- Matt Goldenberg 15 Mar 2023 3:11 UTC
  2 points
  0
  Parent
  AFAICT OpenAI now includes the current date in the prompt, so I think this is wrong
  - Peter Wildeford 15 Mar 2023 13:27 UTC
    2 points
    0
    Parent
    Yep! I was wrong and this is false!