Module clip_executor

Expand description

CLIP Model Executor for multimodal embeddings.

Supports both text and image embedding via unified interface. Text goes through CLIP text encoder, images through vision encoder.