Nice quick check!
Just to be clear: This is for the actual full models? Or for the ‘model embeddings’ as in you’re doing a comparison right after the embedding layer?
This is for the full models—I simply used both models on replicate and gave one image and two text labels as input: CLIP, SigLIP
Nice quick check!
Just to be clear: This is for the actual full models? Or for the ‘model embeddings’ as in you’re doing a comparison right after the embedding layer?
This is for the full models—I simply used both models on replicate and gave one image and two text labels as input: CLIP, SigLIP