pub struct EmbeddingModelWeights { /* private fields */ }Expand description
Embedding model weights. Same as quantized_qwen3::ModelWeights but without lm_head. Returns last-token hidden states for embedding.
Implementations§
Source§impl EmbeddingModelWeights
impl EmbeddingModelWeights
Sourcepub fn from_gguf<R: Read + Seek>(
ct: Content,
reader: &mut R,
device: &CandleDevice,
) -> Result<Self>
pub fn from_gguf<R: Read + Seek>( ct: Content, reader: &mut R, device: &CandleDevice, ) -> Result<Self>
Load from a GGUF file. Same format as generative Qwen3 models.
Sourcepub fn forward(&mut self, input: &Tensor, offset: usize) -> Result<Tensor>
pub fn forward(&mut self, input: &Tensor, offset: usize) -> Result<Tensor>
Forward pass returning last-token hidden state (1024-dim for 0.6B).
Uses causal attention mask (same as generative model) since Qwen3-Embedding is architecturally Qwen3ForCausalLM fine-tuned for embedding. Returns shape: (batch, hidden_size). Forward pass returning last-token hidden state. Only supports batch_size=1 (causal mask not broadcast for larger batches).
pub fn clear_kv_cache(&mut self)
Trait Implementations§
Source§impl Clone for EmbeddingModelWeights
impl Clone for EmbeddingModelWeights
Source§fn clone(&self) -> EmbeddingModelWeights
fn clone(&self) -> EmbeddingModelWeights
Returns a duplicate of the value. Read more
1.0.0 · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
Performs copy-assignment from
source. Read moreAuto Trait Implementations§
impl Freeze for EmbeddingModelWeights
impl !RefUnwindSafe for EmbeddingModelWeights
impl Send for EmbeddingModelWeights
impl Sync for EmbeddingModelWeights
impl Unpin for EmbeddingModelWeights
impl UnsafeUnpin for EmbeddingModelWeights
impl !UnwindSafe for EmbeddingModelWeights
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> CloneToUninit for Twhere
T: Clone,
impl<T> CloneToUninit for Twhere
T: Clone,
Source§impl<T> Instrument for T
impl<T> Instrument for T
Source§fn instrument(self, span: Span) -> Instrumented<Self>
fn instrument(self, span: Span) -> Instrumented<Self>
Source§fn in_current_span(self) -> Instrumented<Self>
fn in_current_span(self) -> Instrumented<Self>
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read more