From what I’ve seen so far, Imagen is more “straightforward” and does a better job generating an image describing the text than DALE-2. But DALE-2 seems to be producing prettier images (which makes sense given it was fine-tuned for aesthetics),
There’s a Github repo up already, so I hope we’ll be able to try an Open source version and actually test on the same prompts as DALE-2.
From what I’ve seen so far, Imagen is more “straightforward” and does a better job generating an image describing the text than DALE-2. But DALE-2 seems to be producing prettier images (which makes sense given it was fine-tuned for aesthetics),
There’s a Github repo up already, so I hope we’ll be able to try an Open source version and actually test on the same prompts as DALE-2.
It’ll be interesting to see Imagen fine-tuned on laion aesthetic