pub struct Sq8Codec {
pub dim: usize,
/* private fields */
}Expand description
SQ8 calibration parameters: per-dimension min/max.
Fields§
§dim: usizeImplementations§
Source§impl Sq8Codec
impl Sq8Codec
Sourcepub fn calibrate(vectors: &[&[f32]], dim: usize) -> Self
pub fn calibrate(vectors: &[&[f32]], dim: usize) -> Self
Calibrate min/max from a set of training vectors.
Scans all vectors to find per-dimension min/max bounds. At least 1000 vectors recommended for stable calibration; for fewer vectors the bounds may be tight, causing clipping on future inserts outside the calibration range.
Sourcepub fn quantize_batch(&self, vectors: &[&[f32]]) -> Vec<u8> ⓘ
pub fn quantize_batch(&self, vectors: &[&[f32]]) -> Vec<u8> ⓘ
Batch quantize: quantize all vectors into a contiguous byte array.
Returns dim * N bytes laid out as [v0_d0, v0_d1, ..., v1_d0, ...].
Sourcepub fn dequantize(&self, quantized: &[u8]) -> Vec<f32>
pub fn dequantize(&self, quantized: &[u8]) -> Vec<f32>
Dequantize INT8 back to FP32 (lossy reconstruction).
Sourcepub fn asymmetric_l2(&self, query: &[f32], candidate: &[u8]) -> f32
pub fn asymmetric_l2(&self, query: &[f32], candidate: &[u8]) -> f32
Asymmetric L2 squared distance: query (FP32) vs candidate (INT8).
This is the hot-path function used during HNSW traversal. The query stays in full precision; only the candidate is quantized.
Sourcepub fn asymmetric_cosine(&self, query: &[f32], candidate: &[u8]) -> f32
pub fn asymmetric_cosine(&self, query: &[f32], candidate: &[u8]) -> f32
Asymmetric cosine distance: query (FP32) vs candidate (INT8).
Sourcepub fn asymmetric_ip(&self, query: &[f32], candidate: &[u8]) -> f32
pub fn asymmetric_ip(&self, query: &[f32], candidate: &[u8]) -> f32
Asymmetric negative inner product: query (FP32) vs candidate (INT8).
Sourcepub fn to_bytes(&self) -> Vec<u8> ⓘ
pub fn to_bytes(&self) -> Vec<u8> ⓘ
Serialize the codec to bytes with a versioned magic header.
Format: [NDSQ\0\0 (6 bytes)][version: u8 = 1][msgpack payload]
Sourcepub fn from_bytes(bytes: &[u8]) -> Result<Self, VectorError>
pub fn from_bytes(bytes: &[u8]) -> Result<Self, VectorError>
Deserialize the codec from bytes produced by Self::to_bytes.
Returns VectorError::InvalidMagic if the header does not match
NDSQ\0\0, and VectorError::UnsupportedVersion for unknown versions.
Trait Implementations§
Source§impl<'de> Deserialize<'de> for Sq8Codec
impl<'de> Deserialize<'de> for Sq8Codec
Source§fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
Source§impl<'__msgpack_de> FromMessagePack<'__msgpack_de> for Sq8Codec
impl<'__msgpack_de> FromMessagePack<'__msgpack_de> for Sq8Codec
Source§impl ToMessagePack for Sq8Codec
impl ToMessagePack for Sq8Codec
Source§impl VectorCodec for Sq8Codec
impl VectorCodec for Sq8Codec
Source§fn encode(&self, v: &[f32]) -> Self::Quantized
fn encode(&self, v: &[f32]) -> Self::Quantized
Encode a single FP32 vector into an SQ8 UnifiedQuantizedVector.
§Panics
The UnifiedQuantizedVector::new call will only fail if the outlier
count mismatches the bitmask, or an outlier dim_index ≥ 64. Neither
condition can arise here: outlier_bitmask is 0 and outliers is
empty. The expect is therefore unreachable in practice.
Source§fn prepare_query(&self, q: &[f32]) -> Self::Query
fn prepare_query(&self, q: &[f32]) -> Self::Query
Prepare the FP32 query for asymmetric distance computations.
For SQ8 the query is used directly without rotation or normalization.
Source§fn adc_lut(&self, _q: &Self::Query) -> Option<AdcLut>
fn adc_lut(&self, _q: &Self::Query) -> Option<AdcLut>
SQ8 has no precomputed ADC table — returns None.
Source§fn fast_symmetric_distance(
&self,
q: &Self::Quantized,
v: &Self::Quantized,
) -> f32
fn fast_symmetric_distance( &self, q: &Self::Quantized, v: &Self::Quantized, ) -> f32
Symmetric L2 squared distance between two SQ8-quantized vectors.
Both codes are dequantized to FP32 via self.mins + self.scales, then
the squared difference is accumulated. This is slower than an exact
INT8-INT8 Hamming estimate but SQ8 has no faster bitwise symmetric form.
Source§fn exact_asymmetric_distance(&self, q: &Self::Query, v: &Self::Quantized) -> f32
fn exact_asymmetric_distance(&self, q: &Self::Query, v: &Self::Quantized) -> f32
Asymmetric L2 squared distance: FP32 query vs INT8 candidate.
Delegates directly to Sq8Codec::asymmetric_l2.
Cosine / IP variants require a separate codec wrapper that normalizes
or negates at encode time.
Source§type Quantized = Sq8Quantized
type Quantized = Sq8Quantized
UnifiedQuantizedVector
reference via AsRef.Auto Trait Implementations§
impl Freeze for Sq8Codec
impl RefUnwindSafe for Sq8Codec
impl Send for Sq8Codec
impl Sync for Sq8Codec
impl Unpin for Sq8Codec
impl UnsafeUnpin for Sq8Codec
impl UnwindSafe for Sq8Codec
Blanket Implementations§
Source§impl<T> ArchivePointee for T
impl<T> ArchivePointee for T
Source§type ArchivedMetadata = ()
type ArchivedMetadata = ()
Source§fn pointer_metadata(
_: &<T as ArchivePointee>::ArchivedMetadata,
) -> <T as Pointee>::Metadata
fn pointer_metadata( _: &<T as ArchivePointee>::ArchivedMetadata, ) -> <T as Pointee>::Metadata
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Source§impl<T> CloneToUninit for Twhere
T: Clone,
impl<T> CloneToUninit for Twhere
T: Clone,
Source§impl<T> Instrument for T
impl<T> Instrument for T
Source§fn instrument(self, span: Span) -> Instrumented<Self>
fn instrument(self, span: Span) -> Instrumented<Self>
Source§fn in_current_span(self) -> Instrumented<Self>
fn in_current_span(self) -> Instrumented<Self>
Source§impl<T> LayoutRaw for T
impl<T> LayoutRaw for T
Source§fn layout_raw(_: <T as Pointee>::Metadata) -> Result<Layout, LayoutError>
fn layout_raw(_: <T as Pointee>::Metadata) -> Result<Layout, LayoutError>
Source§impl<T, N1, N2> Niching<NichedOption<T, N1>> for N2
impl<T, N1, N2> Niching<NichedOption<T, N1>> for N2
Source§unsafe fn is_niched(niched: *const NichedOption<T, N1>) -> bool
unsafe fn is_niched(niched: *const NichedOption<T, N1>) -> bool
Source§fn resolve_niched(out: Place<NichedOption<T, N1>>)
fn resolve_niched(out: Place<NichedOption<T, N1>>)
out indicating that a T is niched.Source§impl<SS, SP> SupersetOf<SS> for SPwhere
SS: SubsetOf<SP>,
impl<SS, SP> SupersetOf<SS> for SPwhere
SS: SubsetOf<SP>,
Source§fn to_subset(&self) -> Option<SS>
fn to_subset(&self) -> Option<SS>
self from the equivalent element of its
superset. Read moreSource§fn is_in_subset(&self) -> bool
fn is_in_subset(&self) -> bool
self is actually part of its subset T (and can be converted to it).Source§fn to_subset_unchecked(&self) -> SS
fn to_subset_unchecked(&self) -> SS
self.to_subset but without any property checks. Always succeeds.Source§fn from_subset(element: &SS) -> SP
fn from_subset(element: &SS) -> SP
self to the equivalent element of its superset.