Module clip

Module clip 

Source
Expand description

Contrastive Language-Image Pre-Training

Contrastive Language-Image Pre-Training (CLIP) is an architecture trained on pairs of images with related texts.

Modules§

text_model
Contrastive Language-Image Pre-Training
vision_model
Contrastive Language-Image Pre-Training

Structs§

ClipConfig
ClipModel

Enums§

EncoderConfig

Functions§

div_l2_norm