Expand description
CLIP Model Executor for multimodal embeddings.
Supports both text and image embedding via unified interface. Text goes through CLIP text encoder, images through vision encoder.
Structsยง
- Clip
Model Executor - CLIP executor for text and image embeddings.