Skip to main content

SparseVectorConfig

Struct SparseVectorConfig 

Source
pub struct SparseVectorConfig {
Show 13 fields pub format: SparseFormat, pub index_size: IndexSize, pub weight_quantization: WeightQuantization, pub weight_threshold: f32, pub block_size: usize, pub bmp_block_size: u32, pub max_bmp_grid_bytes: u64, pub bmp_superblock_size: u32, pub pruning: Option<f32>, pub query_config: Option<SparseQueryConfig>, pub dims: Option<u32>, pub max_weight: Option<f32>, pub min_terms: usize,
}
Expand description

Configuration for sparse vector storage

Research-validated optimizations for learned sparse retrieval (SPLADE, uniCOIL, etc.):

  • Weight threshold (0.01-0.05): Removes ~30-50% of postings with minimal nDCG impact
  • Posting list pruning (0.1): Keeps top 10% per dimension, 50-70% index reduction, <1% nDCG loss
  • Query pruning (top 10-20 dims): 30-50% latency reduction, <2% nDCG loss
  • UInt8 quantization: 4x compression, 1-2% nDCG loss (optimal trade-off)

Fields§

§format: SparseFormat

Index format: MaxScore (DAAT) or BMP (BAAT)

§index_size: IndexSize

Size of dimension/term indices

§weight_quantization: WeightQuantization

Quantization for weights (see WeightQuantization docs for trade-offs)

§weight_threshold: f32

Minimum weight threshold - weights below this value are not indexed

Research recommendation (Guo et al., 2022; SPLADE v2):

  • 0.01-0.05 for SPLADE models removes ~30-50% of postings
  • Minimal impact on nDCG@10 (<1% loss)
  • Major reduction in index size and query latency
§block_size: usize

Block size for posting lists (must be power of 2, default 128 for SIMD) Larger blocks = better compression, smaller blocks = faster seeks. Used by MaxScore format only.

§bmp_block_size: u32

BMP block size: number of consecutive doc_ids per block (must be power of 2). Default 64. Only used when format = Bmp. Smaller = better pruning granularity, larger = less overhead.

§max_bmp_grid_bytes: u64

Maximum BMP grid memory in bytes. If the grid (num_dims × num_blocks) would exceed this, bmp_block_size is automatically increased to cap memory. Default: 256MB. Set to 0 to disable the cap.

§bmp_superblock_size: u32

BMP superblock size: number of consecutive blocks grouped for hierarchical pruning (Carlson et al., SIGIR 2025). Must be power of 2. Default 64. Set to 0 to disable superblock pruning (flat BMP scoring). Only used when format = Bmp.

§pruning: Option<f32>

Static pruning: fraction of postings to keep per inverted list (SEISMIC-style) Lists are sorted by weight descending and truncated to top fraction.

Research recommendation (SPLADE v2, Formal et al., 2021):

  • None = keep all postings (default, exact)
  • Some(0.1) = keep top 10% of postings per dimension
    • 50-70% index size reduction
    • <1% nDCG@10 loss
    • Exploits “concentration of importance” in learned representations

Applied only during initial segment build, not during merge.

§query_config: Option<SparseQueryConfig>

Query-time configuration (tokenizer, weighting)

§dims: Option<u32>

Fixed vocabulary size (number of dimensions) for BMP format.

When set, all BMP segments use the same grid dimensions (rows = dims), enabling zero-copy block-copy merge. The grid is indexed by dim_id directly (no dim_ids Section C needed).

Required for BMP V12 format. Typical values:

  • SPLADE/BERT: 30522 or 105879 (WordPiece / Unigram vocabulary)
  • uniCOIL: 30522
  • Custom models: set to vocabulary size

If None, BMP builder derives dims from observed data (V10 behavior).

§max_weight: Option<f32>

Fixed max weight scale for BMP format.

When set, all BMP segments use the same quantization scale (max_weight_scale = max_weight), eliminating rescaling during merge.

For SPLADE models: 5.0 (covers typical weight range 0-5). If None, BMP builder derives scale from data (V10 behavior).

§min_terms: usize

Minimum number of postings in a dimension before pruning and weight_threshold filtering are applied. Protects dimensions with very few postings from losing most of their signal.

Default: 4. Set to 0 to always apply pruning/filtering.

Implementations§

Source§

impl SparseVectorConfig

Source

pub fn splade() -> Self

SPLADE-optimized config with research-validated defaults

Optimized for SPLADE, uniCOIL, and similar learned sparse retrieval models. Based on research findings from:

  • Pati (2025): UInt8 quantization = 4x compression, 1-2% nDCG loss
  • Formal et al. (2021): SPLADE v2 posting list pruning
  • Qiao et al. (2023): Query dimension pruning and approximate search
  • Guo et al. (2022): Weight thresholding for efficiency

Expected performance vs. full precision baseline:

  • Index size: ~15-25% of original (combined effect of all optimizations)
  • Query latency: 40-60% faster
  • Effectiveness: 2-4% nDCG@10 loss (typically acceptable for production)

Vocabulary: ~30K dimensions (fits in u16)

Source

pub fn splade_bmp() -> Self

SPLADE-optimized config with BMP (Block-Max Pruning) format

Same optimization settings as splade() but uses the BMP block-at-a-time format (Mallia, Suel & Tonellotto, SIGIR 2024) instead of MaxScore. BMP divides the document space into fixed-size blocks and processes them in decreasing upper-bound order, enabling aggressive early termination.

Source

pub fn compact() -> Self

Compact config: Maximum compression (experimental)

Uses aggressive UInt4 quantization for smallest possible index size. Expected trade-offs:

  • Index size: ~10-15% of Float32 baseline
  • Effectiveness: ~3-5% nDCG@10 loss

Recommended for: Memory-constrained environments, cache-heavy workloads

Source

pub fn full_precision() -> Self

Full precision config: No compression, baseline effectiveness

Use for: Research baselines, when effectiveness is critical

Source

pub fn conservative() -> Self

Conservative config: Mild optimizations, minimal effectiveness loss

Balances compression and effectiveness with conservative defaults. Expected trade-offs:

  • Index size: ~40-50% of Float32 baseline
  • Query latency: ~20-30% faster
  • Effectiveness: <1% nDCG@10 loss

Recommended for: Production deployments prioritizing effectiveness

Source

pub fn with_weight_threshold(self, threshold: f32) -> Self

Set weight threshold (builder pattern)

Source

pub fn with_pruning(self, fraction: f32) -> Self

Set posting list pruning fraction (builder pattern) e.g., 0.1 = keep top 10% of postings per dimension

Source

pub fn bytes_per_entry(&self) -> f32

Bytes per entry (index + weight)

Source

pub fn to_byte(&self) -> u8

Serialize config to a single byte.

Layout: bits 7-4 = IndexSize, bit 3 = format (0=MaxScore, 1=BMP), bits 2-0 = WeightQuantization

Source

pub fn from_byte(b: u8) -> Option<Self>

Deserialize config from a single byte.

Note: weight_threshold, block_size, bmp_block_size, and query_config are not serialized in the byte — they come from the schema.

Source

pub fn with_block_size(self, size: usize) -> Self

Set block size (builder pattern) Must be power of 2, recommended: 64, 128, 256

Source

pub fn with_query_config(self, config: SparseQueryConfig) -> Self

Set query configuration (builder pattern)

Trait Implementations§

Source§

impl Clone for SparseVectorConfig

Source§

fn clone(&self) -> SparseVectorConfig

Returns a duplicate of the value. Read more
1.0.0 · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more
Source§

impl Debug for SparseVectorConfig

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more
Source§

impl Default for SparseVectorConfig

Source§

fn default() -> Self

Returns the “default value” for a type. Read more
Source§

impl<'de> Deserialize<'de> for SparseVectorConfig

Source§

fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>
where __D: Deserializer<'de>,

Deserialize this value from the given Serde deserializer. Read more
Source§

impl PartialEq for SparseVectorConfig

Source§

fn eq(&self, other: &SparseVectorConfig) -> bool

Tests for self and other values to be equal, and is used by ==.
1.0.0 · Source§

fn ne(&self, other: &Rhs) -> bool

Tests for !=. The default implementation is almost always sufficient, and should not be overridden without very good reason.
Source§

impl Serialize for SparseVectorConfig

Source§

fn serialize<__S>(&self, __serializer: __S) -> Result<__S::Ok, __S::Error>
where __S: Serializer,

Serialize this value into the given Serde serializer. Read more
Source§

impl StructuralPartialEq for SparseVectorConfig

Auto Trait Implementations§

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> CloneToUninit for T
where T: Clone,

Source§

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)
Performs copy-assignment from self to dest. Read more
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T> IntoEither for T

Source§

fn into_either(self, into_left: bool) -> Either<Self, Self>

Converts self into a Left variant of Either<Self, Self> if into_left is true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
Source§

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

Converts self into a Left variant of Either<Self, Self> if into_left(&self) returns true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
Source§

impl<T> Pointable for T

Source§

const ALIGN: usize

The alignment of pointer.
Source§

type Init = T

The type for initializers.
Source§

unsafe fn init(init: <T as Pointable>::Init) -> usize

Initializes a with the given initializer. Read more
Source§

unsafe fn deref<'a>(ptr: usize) -> &'a T

Dereferences the given pointer. Read more
Source§

unsafe fn deref_mut<'a>(ptr: usize) -> &'a mut T

Mutably dereferences the given pointer. Read more
Source§

unsafe fn drop(ptr: usize)

Drops the object pointed to by the given pointer. Read more
Source§

impl<T> Same for T

Source§

type Output = T

Should always be Self
Source§

impl<SS, SP> SupersetOf<SS> for SP
where SS: SubsetOf<SP>,

Source§

fn to_subset(&self) -> Option<SS>

The inverse inclusion map: attempts to construct self from the equivalent element of its superset. Read more
Source§

fn is_in_subset(&self) -> bool

Checks if self is actually part of its subset T (and can be converted to it).
Source§

fn to_subset_unchecked(&self) -> SS

Use with care! Same as self.to_subset but without any property checks. Always succeeds.
Source§

fn from_subset(element: &SS) -> SP

The inclusion map: converts self to the equivalent element of its superset.
Source§

impl<T> ToOwned for T
where T: Clone,

Source§

type Owned = T

The resulting type after obtaining ownership.
Source§

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more
Source§

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more
Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.
Source§

impl<V, T> VZip<V> for T
where V: MultiLane<T>,

Source§

fn vzip(self) -> V

Source§

impl<T> DeserializeOwned for T
where T: for<'de> Deserialize<'de>,

Source§

impl<T> Scalar for T
where T: 'static + Clone + PartialEq + Debug,