Struct SparseVec

Source

pub struct SparseVec {
    pub pos: Vec<usize>,
    pub neg: Vec<usize>,
}

Expand description

Sparse ternary vector with positive and negative indices

Fields§

§pos: Vec<usize>

Indices with +1 value

§neg: Vec<usize>

Indices with -1 value

Implementations§

Source §

impl SparseVec

Source

pub fn from_seed(seed: &[u8; 32], dim: usize) -> Self

Create a sparse vector from a seed (deterministic)

Source

pub fn from_bytes(data: &[u8]) -> Self

Create a sparse vector directly from bytes

Source §

Associative bundle over many vectors: sums contributions per index, then thresholds to sign. This is order-independent because all contributions are accumulated before applying sign. Complexity: O(K log K) where K is total non-zero entries across inputs.

Source

pub fn bundle_hybrid_many<'a, I>(vectors: I) -> SparseVec
where I: IntoIterator<Item = &'a SparseVec>,

Hybrid bundle: choose a fast pairwise fold for very sparse regimes (to preserve sparsity), otherwise use the associative sum-then-threshold path (order-independent, more faithful to majority).

Heuristic: estimate expected overlap/collision count assuming uniform hashing into DIM. If expected colliding dimensions is below a small budget, use pairwise bundle; else use bundle_sum_many.

Source

pub fn bind(&self, other: &SparseVec) -> SparseVec

Bind operation: non-commutative composition (A ⊙ B) Performs element-wise multiplication. Self-inverse: A ⊙ A ≈ I

§Examples

use embeddenator_vsa::SparseVec;

let config = embeddenator_vsa::ReversibleVSAConfig::default();
let vec = SparseVec::encode_data(b"test", &config, None);
let bound = vec.bind(&vec);

// Bind with self should produce high similarity (self-inverse property)
let identity = SparseVec::encode_data(b"identity", &config, None);
let sim = bound.cosine(&identity);
// Result is approximately identity, so similarity varies
assert!(sim >= -1.0 && sim <= 1.0);

Source

pub fn cosine(&self, other: &SparseVec) -> f64

Calculate cosine similarity between two sparse vectors Returns value in [-1, 1] where 1 is identical, 0 is orthogonal

When the simd feature is enabled, this will automatically use AVX2 (x86_64) or NEON (aarch64) acceleration if available.

§Examples

use embeddenator_vsa::SparseVec;

let config = embeddenator_vsa::ReversibleVSAConfig::default();
let vec1 = SparseVec::encode_data(b"cat", &config, None);
let vec2 = SparseVec::encode_data(b"cat", &config, None);
let vec3 = SparseVec::encode_data(b"dog", &config, None);

// Identical data produces identical vectors
assert!((vec1.cosine(&vec2) - 1.0).abs() < 0.01);

// Different data produces low similarity
let sim = vec1.cosine(&vec3);
assert!(sim < 0.3);

Source

pub fn cosine_scalar(&self, other: &SparseVec) -> f64

Scalar (non-SIMD) cosine similarity implementation.

This is the original implementation and serves as the baseline for SIMD optimizations. It’s also used when SIMD is not available.

Source

pub fn permute(&self, shift: usize) -> SparseVec

Apply cyclic permutation to vector indices Used for encoding sequence order in hierarchical structures

§Arguments

shift - Number of positions to shift indices cyclically

§Examples

use embeddenator_vsa::SparseVec;

let config = embeddenator_vsa::ReversibleVSAConfig::default();
let vec = SparseVec::encode_data(b"test", &config, None);
let permuted = vec.permute(100);

// Permuted vector should have different indices but same structure
assert_eq!(vec.pos.len(), permuted.pos.len());
assert_eq!(vec.neg.len(), permuted.neg.len());

Source

pub fn inverse_permute(&self, shift: usize) -> SparseVec

Apply inverse cyclic permutation to vector indices Decodes sequence order by reversing the permutation shift

§Arguments

shift - Number of positions to reverse shift indices cyclically

§Examples

use embeddenator_vsa::SparseVec;

let config = embeddenator_vsa::ReversibleVSAConfig::default();
let vec = SparseVec::encode_data(b"test", &config, None);
let permuted = vec.permute(100);
let recovered = permuted.inverse_permute(100);

// Round-trip should recover original vector
assert_eq!(vec.pos, recovered.pos);
assert_eq!(vec.neg, recovered.neg);