Expand description
Contrastive Language-Image Pre-Training
Contrastive Language-Image Pre-Training (CLIP) is an architecture trained on pairs of images with related texts.
Structs§
- Clip
Text Transformer - A CLIP transformer based model.
- Config
Contrastive Language-Image Pre-Training
Contrastive Language-Image Pre-Training (CLIP) is an architecture trained on pairs of images with related texts.