Struct SparseVectorConfig

Source

pub struct SparseVectorConfig {
    pub index_size: IndexSize,
    pub weight_quantization: WeightQuantization,
    pub weight_threshold: f32,
    pub block_size: usize,
    pub posting_list_pruning: Option<f32>,
    pub query_config: Option<SparseQueryConfig>,
}

Expand description

Configuration for sparse vector storage

Research-validated optimizations for learned sparse retrieval (SPLADE, uniCOIL, etc.):

Weight threshold (0.01-0.05): Removes ~30-50% of postings with minimal nDCG impact
Posting list pruning (0.1): Keeps top 10% per dimension, 50-70% index reduction, <1% nDCG loss
Query pruning (top 10-20 dims): 30-50% latency reduction, <2% nDCG loss
UInt8 quantization: 4x compression, 1-2% nDCG loss (optimal trade-off)

Fields§

§index_size: IndexSize

Size of dimension/term indices

§weight_quantization: WeightQuantization

Quantization for weights (see WeightQuantization docs for trade-offs)

§weight_threshold: f32

Minimum weight threshold - weights below this value are not indexed

Research recommendation (Guo et al., 2022; SPLADE v2):

0.01-0.05 for SPLADE models removes ~30-50% of postings
Minimal impact on nDCG@10 (<1% loss)
Major reduction in index size and query latency

§block_size: usize

Block size for posting lists (must be power of 2, default 128 for SIMD) Larger blocks = better compression, smaller blocks = faster seeks

§posting_list_pruning: Option<f32>

Static pruning: fraction of postings to keep per inverted list (SEISMIC-style) Lists are sorted by weight descending and truncated to top fraction.

Research recommendation (SPLADE v2, Formal et al., 2021):

None = keep all postings (default, exact)
Some(0.1) = keep top 10% of postings per dimension
- 50-70% index size reduction
- <1% nDCG@10 loss
- Exploits “concentration of importance” in learned representations

Applied only during initial segment build, not during merge.

§query_config: Option<SparseQueryConfig>

Query-time configuration (tokenizer, weighting)

Implementations§

Source §

impl SparseVectorConfig

Source

pub fn splade() -> Self

SPLADE-optimized config with research-validated defaults

Optimized for SPLADE, uniCOIL, and similar learned sparse retrieval models. Based on research findings from:

Pati (2025): UInt8 quantization = 4x compression, 1-2% nDCG loss
Formal et al. (2021): SPLADE v2 posting list pruning
Qiao et al. (2023): Query dimension pruning and approximate search
Guo et al. (2022): Weight thresholding for efficiency

Expected performance vs. full precision baseline:

Index size: ~15-25% of original (combined effect of all optimizations)
Query latency: 40-60% faster
Effectiveness: 2-4% nDCG@10 loss (typically acceptable for production)

Vocabulary: ~30K dimensions (fits in u16)

Source

pub fn compact() -> Self

Compact config: Maximum compression (experimental)

Uses aggressive UInt4 quantization for smallest possible index size. Expected trade-offs:

Index size: ~10-15% of Float32 baseline
Effectiveness: ~3-5% nDCG@10 loss

Recommended for: Memory-constrained environments, cache-heavy workloads

Source

pub fn full_precision() -> Self

Full precision config: No compression, baseline effectiveness

Use for: Research baselines, when effectiveness is critical

Source

pub fn conservative() -> Self

Conservative config: Mild optimizations, minimal effectiveness loss

Balances compression and effectiveness with conservative defaults. Expected trade-offs:

Index size: ~40-50% of Float32 baseline
Query latency: ~20-30% faster
Effectiveness: <1% nDCG@10 loss

Recommended for: Production deployments prioritizing effectiveness

Source

pub fn with_weight_threshold(self, threshold: f32) -> Self

Set weight threshold (builder pattern)

Source

pub fn with_pruning(self, fraction: f32) -> Self

Set posting list pruning fraction (builder pattern) e.g., 0.1 = keep top 10% of postings per dimension

Source

pub fn bytes_per_entry(&self) -> f32

Bytes per entry (index + weight)

Source

pub fn to_byte(&self) -> u8

Serialize config to a single byte

Source

pub fn from_byte(b: u8) -> Option<Self>

Deserialize config from a single byte Note: weight_threshold, block_size and query_config are not serialized in the byte

Source

pub fn with_block_size(self, size: usize) -> Self

Set block size (builder pattern) Must be power of 2, recommended: 64, 128, 256

Source

pub fn with_query_config(self, config: SparseQueryConfig) -> Self

Set query configuration (builder pattern)

Trait Implementations§

Source §

impl Clone for SparseVectorConfig

Source §

fn clone(&self) -> SparseVectorConfig

Returns a duplicate of the value. Read more

1.0.0 · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more

Source §

impl Debug for SparseVectorConfig

Source §

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more

Source §

impl Default for SparseVectorConfig

Source §

fn default() -> Self

Returns the “default value” for a type. Read more

Source §

impl<'de> Deserialize<'de> for SparseVectorConfig

Source §

fn deserialize<D>(deserializer: D) -> Result<Self, D::Error>
where __D: Deserializer<'de>,

Deserialize this value from the given Serde deserializer. Read more

Source §

impl PartialEq for SparseVectorConfig

Source §

fn eq(&self, other: &SparseVectorConfig) -> bool

Tests for self and other values to be equal, and is used by ==.

1.0.0 · Source§

fn ne(&self, other: &Rhs) -> bool

Tests for !=. The default implementation is almost always sufficient, and should not be overridden without very good reason.

Source §

impl Serialize for SparseVectorConfig

Source §

fn serialize<S>(&self, serializer: S) -> Result<S::Ok, S::Error>
where S: Serializer,

Serialize this value into the given Serde serializer. Read more

Source §

impl StructuralPartialEq for SparseVectorConfig

Auto Trait Implementations§

§

impl UnwindSafe for SparseVectorConfig

Blanket Implementations§

Source §

impl<T> Any for T
where T: 'static + ?Sized,

Source §

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more

Source §

impl<T> Borrow<T> for T
where T: ?Sized,

Source §

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more

Source §

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source §

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more

Source §

impl<T> CloneToUninit for T
where T: Clone,

Source §

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)

Performs copy-assignment from self to dest. Read more

Source §

impl<T> From<T> for T

Source §

fn from(t: T) -> T

Returns the argument unchanged.

Source §

impl<T, U> Into for T
where U: From<T>,

Source §

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source §

impl<T> IntoEither for T

Source §

fn into_either(self, into_left: bool) -> Either<Self, Self>

Converts self into a Left variant of Either<Self, Self> if into_left is true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more

Source §

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

Converts self into a Left variant of Either<Self, Self> if into_left(&self) returns true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more

Source §