Enum VectorStorage

Source

pub enum VectorStorage {
    FullPrecision {
        vectors: Vec<f32>,
        norms: Vec<f32>,
        count: usize,
        dimensions: usize,
    },
    BinaryQuantized {
        quantized: Vec<Vec<u8>>,
        original: Option<Vec<Vec<f32>>>,
        thresholds: Vec<f32>,
        dimensions: usize,
    },
    RaBitQQuantized {
        quantizer: Option<RaBitQ>,
        params: RaBitQParams,
        quantized_data: Vec<u8>,
        quantized_scales: Vec<f32>,
        code_size: usize,
        original: Vec<f32>,
        original_count: usize,
        dimensions: usize,
    },
    ScalarQuantized {
        params: ScalarParams,
        quantized: Vec<u8>,
        norms: Vec<f32>,
        sums: Vec<i32>,
        training_buffer: Vec<f32>,
        count: usize,
        dimensions: usize,
        trained: bool,
    },
}

Expand description

Vector storage (quantized or full precision)

Variants§

§

FullPrecision

Full precision f32 vectors - FLAT CONTIGUOUS STORAGE

Memory: dimensions * 4 bytes per vector + 4 bytes for norm Example: 1536D = 6148 bytes per vector

Vectors stored in single contiguous array for cache efficiency. Access: vectors[id * dimensions..(id + 1) * dimensions]

Norms (||v||²) are stored separately for L2 decomposition optimization: ||a-b||² = ||a||² + ||b||² - 2⟨a,b⟩ This reduces L2 distance from 3N FLOPs to 2N+3 FLOPs (~7% faster).

Fields

§vectors: Vec<f32>

Flat contiguous vector data (all vectors concatenated)

§norms: Vec<f32>

Pre-computed squared norms (||v||²) for L2 decomposition

§count: usize

Number of vectors stored

§dimensions: usize

Dimensions per vector

§

BinaryQuantized

Binary quantized vectors

Memory: dimensions / 8 bytes per vector (1 bit per dimension) Example: 1536D = 192 bytes per vector (32x compression)

Fields

§quantized: Vec<Vec<u8>>

Quantized vectors (1 bit per dimension, packed into bytes)

§original: Option<Vec<Vec<f32>>>

Original vectors for reranking (optional)

If present: Memory = quantized + original If absent: Faster but lower recall

§thresholds: Vec<f32>

Quantization thresholds (one per dimension)

§dimensions: usize

Vector dimensions

§

RaBitQQuantized

RaBitQ quantized vectors for asymmetric search (CLOUD MOAT)

Memory: dimensions * bits / 8 bytes per vector (4-bit = 8x compression) Example: 1536D @ 4-bit = 768 bytes per vector

Key optimization: During search, query stays full precision while candidates use quantized representation. This gives 2-3x throughput by avoiding decompression while maintaining accuracy.

Reranking with original vectors restores recall to near full-precision.

Fields

§quantizer: Option<RaBitQ>

RaBitQ quantizer (contains params)

§params: RaBitQParams

RaBitQ parameters (for serialization)

§quantized_data: Vec<u8>

Quantized codes - flat contiguous array for cache efficiency Access: quantized_data[id * code_size..(id + 1) * code_size]

§quantized_scales: Vec<f32>

Per-vector rescaling factors - contiguous for cache efficiency Access: quantized_scales[id]

§code_size: usize

Bytes per quantized vector (computed from dimensions and bits) For 4-bit: code_size = dimensions / 2

§original: Vec<f32>

Original vectors for reranking (required for final accuracy) Stored as flat contiguous array for cache efficiency.

§original_count: usize

Number of original vectors stored

§dimensions: usize

Vector dimensions

§

ScalarQuantized

Scalar quantized vectors (SQ8) - 4x compression, ~97% recall, 2-3x faster

Memory: 1x (quantized only, no originals stored) Trade-off: 4x RAM savings for ~3% recall loss

Uses uniform min/max scaling with integer SIMD distance computation. Lazy training: Buffers first 256 vectors, then trains and quantizes.

Note: No rescore support - originals not stored to save memory. Use RaBitQ if you need rescore with originals on disk.

Fields

§params: ScalarParams

Trained quantization parameters (global scale/offset)

§quantized: Vec<u8>

Quantized vectors as flat contiguous u8 array Empty until training completes (after 256 vectors) Access: quantized[id * dimensions..(id + 1) * dimensions]

§norms: Vec<f32>

Pre-computed squared norms of dequantized vectors for L2 decomposition ||dequant(q)||² = Σ(code[d] * scale + offset)² Enables fast distance: ||a-b||² = ||a||² + ||b||² - 2⟨a,b⟩

§sums: Vec<i32>

Pre-computed sums of quantized values for fast integer dot product sum = Σ quantized[d]

§training_buffer: Vec<f32>

Buffer for training vectors (cleared after training) During training phase, stores f32 vectors until we have enough to train

§count: usize

Number of vectors stored

§dimensions: usize

Vector dimensions

§trained: bool

Whether quantization parameters have been trained Training happens automatically after 256 vectors are inserted

Enum VectorStorage Copy item path

Variants§

FullPrecision

Fields

BinaryQuantized

Fields

RaBitQQuantized

Fields

ScalarQuantized

Fields

Implementations§

impl VectorStorage

pub fn new_full_precision(dimensions: usize) -> Self

pub fn new_binary_quantized(dimensions: usize, keep_original: bool) -> Self

pub fn new_rabitq_quantized(dimensions: usize, params: RaBitQParams) -> Self

§Arguments

§Performance

pub fn new_sq8_quantized(dimensions: usize) -> Self

§Arguments

§Performance

§Lazy Training

pub fn is_asymmetric(&self) -> bool

pub fn is_binary_quantized(&self) -> bool

pub fn is_sq8(&self) -> bool

pub fn len(&self) -> usize

pub fn is_empty(&self) -> bool

pub fn dimensions(&self) -> usize

pub fn insert(&mut self, vector: Vec<f32>) -> Result<u32, String>

pub fn get(&self, id: u32) -> Option<&[f32]>

pub fn get_dequantized(&self, id: u32) -> Option<Vec<f32>>

pub fn distance_asymmetric_l2(&self, query: &[f32], id: u32) -> Option<f32>

§Performance (Apple Silicon M3 Max, 768D)

pub fn get_norm(&self, id: u32) -> Option<f32>

pub fn supports_l2_decomposition(&self) -> bool

pub fn distance_l2_decomposed( &self, query: &[f32], query_norm: f32, id: u32, ) -> Option<f32>

pub fn get_quantized(&self, id: u32) -> Option<QuantizedVector>

pub fn quantizer(&self) -> Option<&RaBitQ>

pub fn build_adc_table(&self, query: &[f32]) -> Option<UnifiedADC>

pub fn distance_adc(&self, adc: &UnifiedADC, id: u32) -> Option<f32>

pub fn prefetch(&self, id: u32)

pub fn prefetch_quantized(&self, id: u32)

pub fn rabitq_code_size(&self) -> Option<usize>

pub fn get_rabitq_code(&self, id: u32) -> Option<&[u8]>

pub fn build_interleaved_codes( &self, neighbors: &[u32], output: &mut [u8], ) -> usize

§Arguments

pub fn train_quantization( &mut self, sample_vectors: &[Vec<f32>], ) -> Result<(), String>

pub fn memory_usage(&self) -> usize

pub fn reorder(&mut self, old_to_new: &[u32])

Trait Implementations§

impl Clone for VectorStorage

fn clone(&self) -> VectorStorage

fn clone_from(&mut self, source: &Self)

impl Debug for VectorStorage

fn fmt(&self, f: &mut Formatter<'_>) -> Result

impl<'de> Deserialize<'de> for VectorStorage

fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where __D: Deserializer<'de>,

impl Serialize for VectorStorage

fn serialize<__S>(&self, __serializer: __S) -> Result<__S::Ok, __S::Error>where __S: Serializer,

Auto Trait Implementations§

impl Freeze for VectorStorage

impl RefUnwindSafe for VectorStorage

impl Send for VectorStorage

impl Sync for VectorStorage

impl Unpin for VectorStorage

impl UnwindSafe for VectorStorage

Blanket Implementations§

impl<T> Any for Twhere T: 'static + ?Sized,

fn type_id(&self) -> TypeId

impl<T> Borrow<T> for Twhere T: ?Sized,

fn borrow(&self) -> &T

impl<T> BorrowMut<T> for Twhere T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

impl<T> CloneToUninit for Twhere T: Clone,

unsafe fn clone_to_uninit(&self, dest: *mut u8)

impl<T> Downcast for Twhere T: Any,

fn into_any(self: Box<T>) -> Box<dyn Any>

fn into_any_rc(self: Rc<T>) -> Rc<dyn Any>

fn as_any(&self) -> &(dyn Any + 'static)

fn as_any_mut(&mut self) -> &mut (dyn Any + 'static)

impl<T> DowncastSend for Twhere T: Any + Send,

Enum VectorStorage

fn deserialize<D>(deserializer: D) -> Result<Self, D::Error>
where __D: Deserializer<'de>,

fn serialize<S>(&self, serializer: S) -> Result<S::Ok, S::Error>
where S: Serializer,

impl<T> Any for T
where T: 'static + ?Sized,

impl<T> Borrow<T> for T
where T: ?Sized,

impl<T> BorrowMut<T> for T
where T: ?Sized,

impl<T> CloneToUninit for T
where T: Clone,

impl<T> Downcast for T
where T: Any,

impl<T> DowncastSend for T
where T: Any + Send,

impl<T> DowncastSync for T
where T: Any + Send + Sync,

impl<T, U> Into<U> for T
where U: From<T>,

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

impl<T> ToOwned for T
where T: Clone,

impl<T, U> TryFrom<U> for T
where U: Into<T>,

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>
where S: Into<Dispatch>,

impl<T> DeserializeOwned for T
where T: for<'de> Deserialize<'de>,

impl<T> Fruit for T
where T: Send + Downcast,