First it was hands. Then it was text, and multi-element composition. What can we still not do with image generation?
Text generation is considerably better, but still limited to few words, maybe few sentences. Ask it to generate you a monitor with Python code on it and you’ll see current limitations of this. It is an improvement for sure but in no way “solved” task.
Text generation is considerably better, but still limited to few words, maybe few sentences. Ask it to generate you a monitor with Python code on it and you’ll see current limitations of this. It is an improvement for sure but in no way “solved” task.