Gerald Monroe comments on What’s the Least Impressive Thing GPT-4 Won’t be Able to Do

Gerald Monroe 19 Mar 2023 20:19 UTC
2 points
0
if you think about it, the right way to “do” this would be to internally generate a terminal with the same width as the chatGPT text window or a standard terminal window width, then generate an image, then process it as an image.
That’s literally what you are doing when you manually align the verticals and look.
GPT-4 is not architecturally doing that, it’s missing that capability yet we can trivially see a toolformer version of it that could decide to feed the input stream to a simulated terminal then feed that to a vision module and then process that would be able to solve it.
Without actually making the core llm any smarter, just giving it more peripherals.

A bunch of stuff like that, you realize the underlying llm is capable of doing it but it’s currently just missing the peripheral.