Struct SparseVectorConfig

Source

pub struct SparseVectorConfig {Show 13 fields
    pub format: SparseFormat,
    pub index_size: IndexSize,
    pub weight_quantization: WeightQuantization,
    pub weight_threshold: f32,
    pub block_size: usize,
    pub bmp_block_size: u32,
    pub max_bmp_grid_bytes: u64,
    pub bmp_superblock_size: u32,
    pub pruning: Option<f32>,
    pub query_config: Option<SparseQueryConfig>,
    pub dims: Option<u32>,
    pub max_weight: Option<f32>,
    pub min_terms: usize,
}

Expand description

Configuration for sparse vector storage

Research-validated optimizations for learned sparse retrieval (SPLADE, uniCOIL, etc.):

Weight threshold (0.01-0.05): Removes ~30-50% of postings with minimal nDCG impact
Posting list pruning (0.1): Keeps top 10% per dimension, 50-70% index reduction, <1% nDCG loss
Query pruning (top 10-20 dims): 30-50% latency reduction, <2% nDCG loss
UInt8 quantization: 4x compression, 1-2% nDCG loss (optimal trade-off)

Fields§

§format: SparseFormat

Index format: MaxScore (DAAT) or BMP (BAAT)

§index_size: IndexSize

Size of dimension/term indices

§weight_quantization: WeightQuantization

Quantization for weights (see WeightQuantization docs for trade-offs)

§weight_threshold: f32

Minimum weight threshold - weights below this value are not indexed

Research recommendation (Guo et al., 2022; SPLADE v2):

0.01-0.05 for SPLADE models removes ~30-50% of postings
Minimal impact on nDCG@10 (<1% loss)
Major reduction in index size and query latency

§block_size: usize

Block size for posting lists (must be power of 2, default 128 for SIMD) Larger blocks = better compression, smaller blocks = faster seeks. Used by MaxScore format only.

§bmp_block_size: u32

BMP block size: number of consecutive doc_ids per block (must be power of 2). Default 64. Only used when format = Bmp. Smaller = better pruning granularity, larger = less overhead.

§max_bmp_grid_bytes: u64

Maximum BMP grid memory in bytes. If the grid (num_dims × num_blocks) would exceed this, bmp_block_size is automatically increased to cap memory. Default: 256MB. Set to 0 to disable the cap.

§bmp_superblock_size: u32

BMP superblock size: number of consecutive blocks grouped for hierarchical pruning (Carlson et al., SIGIR 2025). Must be power of 2. Default 64. Set to 0 to disable superblock pruning (flat BMP scoring). Only used when format = Bmp.

§pruning: Option<f32>

Static pruning: fraction of postings to keep per inverted list (SEISMIC-style) Lists are sorted by weight descending and truncated to top fraction.

Research recommendation (SPLADE v2, Formal et al., 2021):

None = keep all postings (default, exact)
Some(0.1) = keep top 10% of postings per dimension
- 50-70% index size reduction
- <1% nDCG@10 loss
- Exploits “concentration of importance” in learned representations

Applied only during initial segment build, not during merge.

§query_config: Option<SparseQueryConfig>

Query-time configuration (tokenizer, weighting)

§dims: Option<u32>

Fixed vocabulary size (number of dimensions) for BMP format.

When set, all BMP segments use the same grid dimensions (rows = dims), enabling zero-copy block-copy merge. The grid is indexed by dim_id directly (no dim_ids Section C needed).

Required for BMP V12 format. Typical values:

SPLADE/BERT: 30522 or 105879 (WordPiece / Unigram vocabulary)
uniCOIL: 30522
Custom models: set to vocabulary size

If None, BMP builder derives dims from observed data (V10 behavior).

§max_weight: Option<f32>

Fixed max weight scale for BMP format.

When set, all BMP segments use the same quantization scale (max_weight_scale = max_weight), eliminating rescaling during merge.

For SPLADE models: 5.0 (covers typical weight range 0-5). If None, BMP builder derives scale from data (V10 behavior).

§min_terms: usize

Minimum number of postings in a dimension before pruning and weight_threshold filtering are applied. Protects dimensions with very few postings from losing most of their signal.

Default: 4. Set to 0 to always apply pruning/filtering.

Struct SparseVectorConfig Copy item path

Fields§

Implementations§

impl SparseVectorConfig

pub fn splade() -> Self

pub fn splade_bmp() -> Self

pub fn compact() -> Self

pub fn full_precision() -> Self

pub fn conservative() -> Self

pub fn with_weight_threshold(self, threshold: f32) -> Self

pub fn with_pruning(self, fraction: f32) -> Self

pub fn bytes_per_entry(&self) -> f32

pub fn to_byte(&self) -> u8

pub fn from_byte(b: u8) -> Option<Self>

pub fn with_block_size(self, size: usize) -> Self

pub fn with_query_config(self, config: SparseQueryConfig) -> Self

Trait Implementations§

impl Clone for SparseVectorConfig

fn clone(&self) -> SparseVectorConfig

fn clone_from(&mut self, source: &Self)

impl Debug for SparseVectorConfig

fn fmt(&self, f: &mut Formatter<'_>) -> Result

impl Default for SparseVectorConfig

fn default() -> Self

impl<'de> Deserialize<'de> for SparseVectorConfig

fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where __D: Deserializer<'de>,

impl PartialEq for SparseVectorConfig

fn eq(&self, other: &SparseVectorConfig) -> bool

fn ne(&self, other: &Rhs) -> bool

impl Serialize for SparseVectorConfig

fn serialize<__S>(&self, __serializer: __S) -> Result<__S::Ok, __S::Error>where __S: Serializer,

impl StructuralPartialEq for SparseVectorConfig

Auto Trait Implementations§

impl Freeze for SparseVectorConfig

impl RefUnwindSafe for SparseVectorConfig

impl Send for SparseVectorConfig

impl Sync for SparseVectorConfig

impl Unpin for SparseVectorConfig

impl UnsafeUnpin for SparseVectorConfig

impl UnwindSafe for SparseVectorConfig

Blanket Implementations§

impl<T> Any for Twhere T: 'static + ?Sized,

fn type_id(&self) -> TypeId

impl<T> Borrow<T> for Twhere T: ?Sized,

fn borrow(&self) -> &T

impl<T> BorrowMut<T> for Twhere T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

impl<T> CloneToUninit for Twhere T: Clone,

unsafe fn clone_to_uninit(&self, dest: *mut u8)

impl<T> From<T> for T

fn from(t: T) -> T

impl<T, U> Into<U> for Twhere U: From<T>,

fn into(self) -> U

impl<T> IntoEither for T

fn into_either(self, into_left: bool) -> Either<Self, Self>

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>where F: FnOnce(&Self) -> bool,

impl<T> Pointable for T

const ALIGN: usize

type Init = T

unsafe fn init(init: <T as Pointable>::Init) -> usize

unsafe fn deref<'a>(ptr: usize) -> &'a T

unsafe fn deref_mut<'a>(ptr: usize) -> &'a mut T

unsafe fn drop(ptr: usize)

impl<T> Same for T

type Output = T

impl<SS, SP> SupersetOf<SS> for SPwhere SS: SubsetOf<SP>,

fn to_subset(&self) -> Option<SS>

fn is_in_subset(&self) -> bool

fn to_subset_unchecked(&self) -> SS

fn from_subset(element: &SS) -> SP

impl<T> ToOwned for Twhere T: Clone,

type Owned = T

fn to_owned(&self) -> T

fn clone_into(&self, target: &mut T)

impl<T, U> TryFrom<U> for Twhere U: Into<T>,

type Error = Infallible

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

impl<T, U> TryInto<U> for Twhere U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Struct SparseVectorConfig

fn deserialize<D>(deserializer: D) -> Result<Self, D::Error>
where __D: Deserializer<'de>,

fn serialize<S>(&self, serializer: S) -> Result<S::Ok, S::Error>
where S: Serializer,

impl<T> Any for T
where T: 'static + ?Sized,

impl<T> Borrow<T> for T
where T: ?Sized,

impl<T> BorrowMut<T> for T
where T: ?Sized,

impl<T> CloneToUninit for T
where T: Clone,

impl<T, U> Into<U> for T
where U: From<T>,

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

impl<SS, SP> SupersetOf<SS> for SP
where SS: SubsetOf<SP>,

impl<T> ToOwned for T
where T: Clone,

impl<T, U> TryFrom<U> for T
where U: Into<T>,

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

impl<V, T> VZip<V> for T
where V: MultiLane<T>,

impl<T> DeserializeOwned for T
where T: for<'de> Deserialize<'de>,

impl<T> Scalar for T
where T: 'static + Clone + PartialEq + Debug,