Jacob G-W comments on Google Gemini Announced

Jacob G-W Dec 6, 2023, 6:33 PM
7 points
0
It seems to do something similar to Gato where everything is just serialized into tokens, which is pretty cool
I wonder if they are just doing a standard transformer for everything, or doing some sort of diffusion model for the images inside the model?
- Gerald Monroe Dec 6, 2023, 11:58 PM
  4 points
  0
  Parent
  What does it mean for perception to compress a frame of video to 1k tokens? What kind of information gets lost when you do this?