Module text_model

Module text_model 

Source
Expand description

Contrastive Language-Image Pre-Training

Contrastive Language-Image Pre-Training (CLIP) is an architecture trained on pairs of images with related texts.

Structs§

ClipEncoder
ClipTextConfig
ClipTextTransformer
A CLIP transformer based model.

Enums§

Activation