Struct CachedIndex

Source

pub struct CachedIndex<I> { /* private fields */ }

Expand description

A drop-in IndexCore wrapper that memoizes search results.

CachedIndex holds any I: IndexCore and forwards every call to it, with one addition: identical search calls — same query and same SearchParams — are served from an in-memory LRU cache instead of re-running the search. Because it is an IndexCore, it slots in anywhere the wrapped index does, including behind Box<dyn IndexCore>.

§Correctness

The cache never returns a stale result. Every mutation that can change the search space — insert, insert_batch, and delete — invalidates the cache, so a search after a write always recomputes against the current index. Operations that do not change the result set (flush and the read-only accessors) leave the cache intact.

§Opt-in

Caching is an optimization a caller chooses by wrapping an index; the database leaves indexes unwrapped by default. Construct a cache that holds a fixed number of recent searches with new or with_capacity, or tune it through a CacheConfig with with_config. A capacity of 0 disables caching entirely: every search passes straight through, which is useful for A/B measuring the cache’s effect without changing call sites.

§Time-to-live

A CacheConfig::ttl gives entries an expiry: a cached result older than the TTL is treated as a miss and recomputed. Mutations through this wrapper already invalidate exactly, so the TTL exists to bound staleness from changes the wrapper cannot see — for example, the wrapped index mutated through another handle. With no TTL (the default) the clock is never consulted.

§Concurrency

CachedIndex is Send + Sync whenever I is (which every IndexCore is). Reads share the cache behind a Mutex held only for the lookup and the insert — never across the wrapped search — so concurrent misses run the underlying search in parallel rather than serializing on the lock.

§Examples

use std::sync::Arc;

use iqdb_cache::CachedIndex;
use iqdb_index::{Index, IndexCore, IndexStats};
use iqdb_types::{DistanceMetric, Hit, IqdbError, Metadata, Result, SearchParams, VectorId};

// A minimal index that returns one hit per search; enough to show the wrap.
struct Stub {
    dim: usize,
    metric: DistanceMetric,
    ids: Vec<VectorId>,
}

impl IndexCore for Stub {
    fn insert(&mut self, id: VectorId, _v: Arc<[f32]>, _m: Option<Metadata>) -> Result<()> {
        self.ids.push(id);
        Ok(())
    }
    fn delete(&mut self, id: &VectorId) -> Result<()> {
        match self.ids.iter().position(|x| x == id) {
            Some(pos) => { let _ = self.ids.remove(pos); Ok(()) }
            None => Err(IqdbError::NotFound),
        }
    }
    fn search(&self, _q: &[f32], params: &SearchParams) -> Result<Vec<Hit>> {
        Ok(self.ids.iter().take(params.k).map(|id| Hit::new(id.clone(), 0.0)).collect())
    }
    fn len(&self) -> usize { self.ids.len() }
    fn dim(&self) -> usize { self.dim }
    fn metric(&self) -> DistanceMetric { self.metric }
    fn flush(&mut self) -> Result<()> { Ok(()) }
    fn stats(&self) -> IndexStats {
        IndexStats { n_vectors: self.ids.len(), index_type: "stub", ..IndexStats::default() }
    }
}

let stub = Stub { dim: 3, metric: DistanceMetric::Cosine, ids: vec![VectorId::from(1u64)] };
let mut cached = CachedIndex::new(stub);

let params = SearchParams::new(1, DistanceMetric::Cosine);
let first = cached.search(&[1.0, 0.0, 0.0], &params)?;  // miss: runs the search
let again = cached.search(&[1.0, 0.0, 0.0], &params)?;  // hit: served from cache
assert_eq!(first, again);

let stats = cached.cache_stats();
assert_eq!(stats.hits, 1);
assert_eq!(stats.misses, 1);

Struct CachedIndex Copy item path

§Correctness

§Opt-in

§Time-to-live

§Concurrency

§Examples

Implementations§

impl<I: IndexCore> CachedIndex<I>

pub fn new(inner: I) -> Self

§Examples

pub fn with_capacity(inner: I, capacity: usize) -> Self

§Examples

pub fn with_config(inner: I, config: CacheConfig) -> Self

§Examples

pub fn capacity(&self) -> usize

pub fn ttl(&self) -> Option<Duration>

pub fn policy(&self) -> EvictionPolicy

pub fn is_enabled(&self) -> bool

pub fn get_ref(&self) -> &I

pub fn into_inner(self) -> I

§Examples

pub fn clear_cache(&mut self)

pub fn cache_stats(&self) -> CacheStats

Trait Implementations§

impl<I: IndexCore> IndexCore for CachedIndex<I>

fn insert( &mut self, id: VectorId, vector: Arc<[f32]>, metadata: Option<Metadata>, ) -> Result<()>

fn insert_batch( &mut self, items: Vec<(VectorId, Arc<[f32]>, Option<Metadata>)>, ) -> Result<()>

fn delete(&mut self, id: &VectorId) -> Result<()>

fn search(&self, query: &[f32], params: &SearchParams) -> Result<Vec<Hit>>

fn len(&self) -> usize

fn is_empty(&self) -> bool

fn dim(&self) -> usize

fn metric(&self) -> DistanceMetric

fn flush(&mut self) -> Result<()>

fn stats(&self) -> IndexStats

fn search_batch( &self, queries: &[&[f32]], params: &SearchParams, ) -> Result<Vec<Vec<Hit>>, IqdbError>

Auto Trait Implementations§

impl<I> !Freeze for CachedIndex<I>

impl<I> !RefUnwindSafe for CachedIndex<I>

impl<I> !UnwindSafe for CachedIndex<I>

impl<I> Send for CachedIndex<I>where I: Send,

impl<I> Sync for CachedIndex<I>where I: Sync,

impl<I> Unpin for CachedIndex<I>where I: Unpin,

impl<I> UnsafeUnpin for CachedIndex<I>where I: UnsafeUnpin,

Blanket Implementations§

impl<T> Any for Twhere T: 'static + ?Sized,

fn type_id(&self) -> TypeId

impl<T> Borrow<T> for Twhere T: ?Sized,

fn borrow(&self) -> &T

impl<T> BorrowMut<T> for Twhere T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

impl<T> From<T> for T

fn from(t: T) -> T

impl<T, U> Into<U> for Twhere U: From<T>,

fn into(self) -> U

impl<T, U> TryFrom<U> for Twhere U: Into<T>,

type Error = Infallible

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

impl<T, U> TryInto<U> for Twhere U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

impl<E> WithErrorCode<E> for E

fn with_code(self, code: impl Into<String>) -> CodedError<E>

Struct CachedIndex

impl<I> Send for CachedIndex<I>
where I: Send,

impl<I> Sync for CachedIndex<I>
where I: Sync,

impl<I> Unpin for CachedIndex<I>
where I: Unpin,

impl<I> UnsafeUnpin for CachedIndex<I>
where I: UnsafeUnpin,

impl<T> Any for T
where T: 'static + ?Sized,

impl<T> Borrow<T> for T
where T: ?Sized,

impl<T> BorrowMut<T> for T
where T: ?Sized,

impl<T, U> Into<U> for T
where U: From<T>,

impl<T, U> TryFrom<U> for T
where U: Into<T>,

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,