Struct Embedder

Source

pub struct Embedder { /* private fields */ }

Expand description

Text embedding generator using nomic-embed-text-v1.5

Automatically downloads the model from HuggingFace Hub on first use. Detects GPU availability and uses CUDA/TensorRT when available.

§Example

use cqs::Embedder;

let mut embedder = Embedder::new()?;
let embedding = embedder.embed_query("parse configuration file")?;
println!("Embedding dimension: {}", embedding.len()); // 768

Implementations§

Source §

impl Embedder

Source

pub fn new() -> Result<Self, EmbedderError>

Create a new embedder, downloading the model if necessary

Automatically detects GPU and uses CUDA/TensorRT when available. Falls back to CPU if no GPU is found.

Note: ONNX session is lazy-loaded on first embedding request (~500ms).

Source

pub fn new_cpu() -> Result<Self, EmbedderError>

Create a CPU-only embedder

Use this for single-query embedding where CPU is faster than GPU due to CUDA context setup overhead. GPU only helps for batch embedding.

Source

pub fn token_count(&self, text: &str) -> Result<usize, EmbedderError>

Count tokens in a text

Source

pub fn split_into_windows( &self, text: &str, max_tokens: usize, overlap: usize, ) -> Result<Vec<(String, u32)>, EmbedderError>

Split text into overlapping windows of max_tokens with overlap tokens of context. Returns Vec of (window_content, window_index). If text fits in max_tokens, returns single window with index 0.

Source

pub fn embed_documents( &mut self, texts: &[&str], ) -> Result<Vec<Embedding>, EmbedderError>

Embed documents (code chunks). Adds “passage: “ prefix for E5.

Source

pub fn embed_query(&mut self, text: &str) -> Result<Embedding, EmbedderError>

Embed a query. Adds “query: “ prefix for E5. Uses LRU cache for repeated queries.

Source

pub fn provider(&self) -> ExecutionProvider

Get the execution provider being used

Source

pub fn batch_size(&self) -> usize

Get the batch size

Source

pub fn warm(&mut self) -> Result<(), EmbedderError>

Warm up the model with a dummy inference

Auto Trait Implementations§

§

impl UnwindSafe for Embedder

Blanket Implementations§

Source §

impl<T> Any for T
where T: 'static + ?Sized,

Source §

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more

Source §

impl<T> Borrow<T> for T
where T: ?Sized,

Source §

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more

Source §

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source §

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more

Source §

impl<T> From<T> for T

Source §

fn from(t: T) -> T

Returns the argument unchanged.

Source §

impl<T> Instrument for T

Source §

fn instrument(self, span: Span) -> Instrumented<Self>

Instruments this type with the provided Span, returning an Instrumented wrapper. Read more

Source §

fn in_current_span(self) -> Instrumented<Self>

Instruments this type with the current Span, returning an Instrumented wrapper. Read more

Source §

impl<T, U> Into for T
where U: From<T>,

Source §

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source §

impl<T> IntoEither for T

Source §

fn into_either(self, into_left: bool) -> Either<Self, Self>

Converts self into a Left variant of Either<Self, Self> if into_left is true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more

Source §

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

Converts self into a Left variant of Either<Self, Self> if into_left(&self) returns true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more

Source §