pub struct EmbeddingQuantizer {
pub bits: u8,
}Expand description
Batch quantizer supporting 4-bit and 8-bit precision.
Fields§
§bits: u8Quantization bit width (4 or 8).
Implementations§
Source§impl EmbeddingQuantizer
impl EmbeddingQuantizer
Sourcepub fn new(bits: u8) -> Self
pub fn new(bits: u8) -> Self
Create a new quantizer with the specified bit width.
bits should be 4 or 8; other values are accepted but treated as 8.
Sourcepub fn quantize_batch(&self, embeddings: &[Vec<f32>]) -> Vec<QuantizedEmbedding>
pub fn quantize_batch(&self, embeddings: &[Vec<f32>]) -> Vec<QuantizedEmbedding>
Quantize a batch of embeddings.
Sourcepub fn dequantize_batch(
&self,
quantized: &[QuantizedEmbedding],
) -> Vec<Vec<f32>>
pub fn dequantize_batch( &self, quantized: &[QuantizedEmbedding], ) -> Vec<Vec<f32>>
Dequantize a batch of quantized embeddings.
Sourcepub fn compression_ratio(&self, original: &[Vec<f32>]) -> f64
pub fn compression_ratio(&self, original: &[Vec<f32>]) -> f64
Compute compression ratio: original_bytes / quantized_bytes.
Trait Implementations§
Source§impl Clone for EmbeddingQuantizer
impl Clone for EmbeddingQuantizer
Source§fn clone(&self) -> EmbeddingQuantizer
fn clone(&self) -> EmbeddingQuantizer
Returns a duplicate of the value. Read more
1.0.0 · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
Performs copy-assignment from
source. Read moreAuto Trait Implementations§
impl Freeze for EmbeddingQuantizer
impl RefUnwindSafe for EmbeddingQuantizer
impl Send for EmbeddingQuantizer
impl Sync for EmbeddingQuantizer
impl Unpin for EmbeddingQuantizer
impl UnsafeUnpin for EmbeddingQuantizer
impl UnwindSafe for EmbeddingQuantizer
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> CloneToUninit for Twhere
T: Clone,
impl<T> CloneToUninit for Twhere
T: Clone,
Source§impl<T> Instrument for T
impl<T> Instrument for T
Source§fn instrument(self, span: Span) -> Instrumented<Self>
fn instrument(self, span: Span) -> Instrumented<Self>
Source§fn in_current_span(self) -> Instrumented<Self>
fn in_current_span(self) -> Instrumented<Self>
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§impl<T> Pointable for T
impl<T> Pointable for T
Source§impl<T> PolicyExt for Twhere
T: ?Sized,
impl<T> PolicyExt for Twhere
T: ?Sized,
Source§impl<SS, SP> SupersetOf<SS> for SPwhere
SS: SubsetOf<SP>,
impl<SS, SP> SupersetOf<SS> for SPwhere
SS: SubsetOf<SP>,
Source§fn to_subset(&self) -> Option<SS>
fn to_subset(&self) -> Option<SS>
The inverse inclusion map: attempts to construct
self from the equivalent element of its
superset. Read moreSource§fn is_in_subset(&self) -> bool
fn is_in_subset(&self) -> bool
Checks if
self is actually part of its subset T (and can be converted to it).Source§fn to_subset_unchecked(&self) -> SS
fn to_subset_unchecked(&self) -> SS
Use with care! Same as
self.to_subset but without any property checks. Always succeeds.Source§fn from_subset(element: &SS) -> SP
fn from_subset(element: &SS) -> SP
The inclusion map: converts
self to the equivalent element of its superset.