It would be cool to try some style-matching between the text and images. Ultimately, having some “personality vector” which would be used both in image and text generation. (A very crude version could be to create a NN translator from the style space to word2vec space and include the words in the GPT prompts)
It would be cool to try some style-matching between the text and images. Ultimately, having some “personality vector” which would be used both in image and text generation. (A very crude version could be to create a NN translator from the style space to word2vec space and include the words in the GPT prompts)