Skip to main content

Module onnx

Module onnx 

Source
Expand description

ONNX Runtime embedding backend.

Loads a local ONNX model (all-MiniLM-L6-v2 quantized) and provides embedding inference. Supports automatic model download and caching.

Structsยง

OnnxEmbedder
ONNX-based embedding backend using all-MiniLM-L6-v2.