Skip to main content

EmbeddingModel

Enum EmbeddingModel 

Source
pub enum EmbeddingModel {
    BgeLarge,
    MiniLM,
    BgeSmall,
    E5Small,
    ModernBertEmbedBase,
}
Expand description

Supported embedding models.

Variants§

§

BgeLarge

BAAI/bge-large-en-v1.5 - Highest quality, 1024 dimensions (default)

  • Dimensions: 1024
  • Max tokens: 512
  • Speed: Slower than small models, but highest quality
§

MiniLM

all-MiniLM-L6-v2 - Fast and efficient, good for general use

  • Dimensions: 384
  • Max tokens: 256
  • Speed: Fastest
§

BgeSmall

BAAI/bge-small-en-v1.5 - Balanced quality and speed

  • Dimensions: 384
  • Max tokens: 512
  • Speed: Medium
§

E5Small

intfloat/e5-small-v2 - Higher quality embeddings

  • Dimensions: 384
  • Max tokens: 512
  • Speed: Medium
§

ModernBertEmbedBase

nomic-ai/modernbert-embed-base — modern transformer with 8192-token context

  • Dimensions: 768 (native Matryoshka: 768/512/256/128/64)
  • Max tokens: 8192
  • Speed: 25% faster than BGE-Large on same hardware
  • Flash Attention support for long sequences
  • Env var: DAKERA_MODEL=modernbert-embed-base

Implementations§

Source§

impl EmbeddingModel

Source

pub fn model_id(&self) -> &'static str

Get the HuggingFace model ID.

Source

pub fn dimension(&self) -> usize

Get the embedding dimension for this model.

Source

pub fn max_seq_length(&self) -> usize

Get the maximum sequence length (in tokens).

Source

pub fn mrl_dimensions(&self) -> Option<&'static [usize]>

Get the Matryoshka-supported dimensions for this model (smallest to largest).

Returns None for models that do not support MRL truncation.

Source

pub fn safetensors_filename(&self) -> &'static str

Get the safetensors model filename (for Candle backend).

Source

pub fn config_filename(&self) -> &'static str

Get the config filename (for Candle/GGUF backends).

Source

pub fn model2vec_repo_id(&self) -> &'static str

Get the HuggingFace repo hosting the Model2Vec distilled vocabulary matrix.

Source

pub fn gguf_repo_id(&self) -> &'static str

Get the HuggingFace repo hosting the GGUF quantised models for this embedding.

Source

pub fn query_prefix(&self) -> Option<&'static str>

Get the query prefix for models that require it. Some models like E5 require a prefix for queries vs documents.

Source

pub fn document_prefix(&self) -> Option<&'static str>

Get the document/passage prefix for models that require it.

Source

pub fn use_mean_pooling(&self) -> bool

Whether this model uses mean pooling (vs CLS token).

Source

pub fn normalize_embeddings(&self) -> bool

Whether embeddings should be normalized.

Source

pub fn tokens_per_second_cpu(&self) -> usize

Get approximate tokens per second on CPU (for estimation).

Source

pub fn onnx_repo_id(&self) -> &'static str

Get the HuggingFace repository ID hosting the ONNX INT8 model for this embedding model.

These are Xenova-hosted Optimum ONNX exports — quantized INT8, pre-built, no conversion needed. BgeLarge: ~130 MB, MiniLM: 23 MB, BGE-small: 35 MB, E5-small: 35 MB.

Source

pub fn onnx_filename(&self) -> &'static str

Get the ONNX model filename for CPU inference (INT8 quantized).

Source

pub fn onnx_filename_gpu(&self) -> &'static str

Get the ONNX model filename for GPU (CUDA EP) inference.

Returns the FP32 model (onnx/model.onnx) instead of INT8. The INT8 quantized model has 336 Memcpy CPU↔GPU round-trips caused by ORT falling back to CPU EP for every unsupported INT8 op — making CUDA 24× slower than pure CPU inference. The FP32 model contains no unsupported ops and runs entirely on-device.

Source

pub fn all() -> &'static [EmbeddingModel]

List all available models.

Source

pub fn parse(s: &str) -> Option<Self>

Parse model from string (case-insensitive).

Source

pub fn from_env() -> Self

Get the active model from DAKERA_MODEL env var, defaulting to BgeLarge.

Trait Implementations§

Source§

impl Clone for EmbeddingModel

Source§

fn clone(&self) -> EmbeddingModel

Returns a duplicate of the value. Read more
1.0.0 (const: unstable) · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more
Source§

impl Debug for EmbeddingModel

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more
Source§

impl Default for EmbeddingModel

Source§

fn default() -> EmbeddingModel

Returns the “default value” for a type. Read more
Source§

impl<'de> Deserialize<'de> for EmbeddingModel

Source§

fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>
where __D: Deserializer<'de>,

Deserialize this value from the given Serde deserializer. Read more
Source§

impl Display for EmbeddingModel

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more
Source§

impl Hash for EmbeddingModel

Source§

fn hash<__H: Hasher>(&self, state: &mut __H)

Feeds this value into the given Hasher. Read more
1.3.0 · Source§

fn hash_slice<H>(data: &[Self], state: &mut H)
where H: Hasher, Self: Sized,

Feeds a slice of this type into the given Hasher. Read more
Source§

impl PartialEq for EmbeddingModel

Source§

fn eq(&self, other: &EmbeddingModel) -> bool

Tests for self and other values to be equal, and is used by ==.
1.0.0 (const: unstable) · Source§

fn ne(&self, other: &Rhs) -> bool

Tests for !=. The default implementation is almost always sufficient, and should not be overridden without very good reason.
Source§

impl Serialize for EmbeddingModel

Source§

fn serialize<__S>(&self, __serializer: __S) -> Result<__S::Ok, __S::Error>
where __S: Serializer,

Serialize this value into the given Serde serializer. Read more
Source§

impl Copy for EmbeddingModel

Source§

impl Eq for EmbeddingModel

Source§

impl StructuralPartialEq for EmbeddingModel

Auto Trait Implementations§

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> CloneToUninit for T
where T: Clone,

Source§

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)
Performs copy-assignment from self to dest. Read more
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<T> Instrument for T

Source§

fn instrument(self, span: Span) -> Instrumented<Self>

Instruments this type with the provided Span, returning an Instrumented wrapper. Read more
Source§

fn in_current_span(self) -> Instrumented<Self>

Instruments this type with the current Span, returning an Instrumented wrapper. Read more
Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T> IntoEither for T

Source§

fn into_either(self, into_left: bool) -> Either<Self, Self>

Converts self into a Left variant of Either<Self, Self> if into_left is true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
Source§

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

Converts self into a Left variant of Either<Self, Self> if into_left(&self) returns true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
Source§

impl<T> Pointable for T

Source§

const ALIGN: usize

The alignment of pointer.
Source§

type Init = T

The type for initializers.
Source§

unsafe fn init(init: <T as Pointable>::Init) -> usize

Initializes a with the given initializer. Read more
Source§

unsafe fn deref<'a>(ptr: usize) -> &'a T

Dereferences the given pointer. Read more
Source§

unsafe fn deref_mut<'a>(ptr: usize) -> &'a mut T

Mutably dereferences the given pointer. Read more
Source§

unsafe fn drop(ptr: usize)

Drops the object pointed to by the given pointer. Read more
Source§

impl<T> PolicyExt for T
where T: ?Sized,

Source§

fn and<P, B, E>(self, other: P) -> And<T, P>
where T: Policy<B, E>, P: Policy<B, E>,

Create a new Policy that returns Action::Follow only if self and other return Action::Follow. Read more
Source§

fn or<P, B, E>(self, other: P) -> Or<T, P>
where T: Policy<B, E>, P: Policy<B, E>,

Create a new Policy that returns Action::Follow if either self or other returns Action::Follow. Read more
Source§

impl<T> ToOwned for T
where T: Clone,

Source§

type Owned = T

The resulting type after obtaining ownership.
Source§

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more
Source§

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more
Source§

impl<T> ToString for T
where T: Display + ?Sized,

Source§

fn to_string(&self) -> String

Converts the given value to a String. Read more
Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.
Source§

impl<V, T> VZip<V> for T
where V: MultiLane<T>,

Source§

fn vzip(self) -> V

Source§

impl<T> WithSubscriber for T

Source§

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>
where S: Into<Dispatch>,

Attaches the provided Subscriber to this type, returning a WithDispatch wrapper. Read more
Source§

fn with_current_subscriber(self) -> WithDispatch<Self>

Attaches the current default Subscriber to this type, returning a WithDispatch wrapper. Read more
Source§

impl<ST, DT> CastableFrom<ST, Initialized, Initialized> for DT
where ST: ?Sized, DT: ?Sized,

Source§

impl<ST, DT> CastableFrom<ST, Uninit, Uninit> for DT
where ST: ?Sized, DT: ?Sized,

Source§

impl<T> DeserializeOwned for T
where T: for<'de> Deserialize<'de>,

Source§

impl<T> Read<Exclusive, BecauseExclusive> for T
where T: ?Sized,