Struct CuckooFilter

Source

pub struct CuckooFilter<H = DefaultHasher>where
    H: Hasher + Default,
{ /* private fields */ }

Expand description

A highly concurrent lock-free probabilistic data structure for set membership testing.

§What Makes It “Cuckoo”

Named after the cuckoo bird’s behavior of displacing other birds’ eggs, this filter uses cuckoo hashing where each item can be stored in one of two possible locations. When both locations are full, existing items are “evicted” (like cuckoo eggs) and relocated to their alternate position, creating eviction chains.

§Algorithm Overview

Fingerprints: Items are reduced to small fingerprints (4-32 bits) instead of storing full keys, providing excellent space efficiency.
Dual Hashing: Each item has two possible bucket locations computed from its hash. This provides better space efficiency and flexibility when inserting and removing items.
Eviction Chains: When both buckets are full, a random item is evicted from one bucket and moved to its alternate location, potentially triggering a chain of evictions.
Lock-Free Concurrency: All operations use atomic compare-exchange loops instead of traditional locks, with optimistic concurrency control for read operations. The only exception is when inserting with evictions, where a FullyExclusive lock is used to ensure consistency.

§Key Advantages Over Bloom Filters

Deletions supported: Items can be removed without false negatives
Better space efficiency: ~20-30% less memory for same false positive rate
Bounded lookup time: Always at most 2 bucket checks, never more
High concurrency: Lock-free design enables excellent parallel performance

§Concurrency Model

Reads: Optimistic, can proceed concurrently with most operations
Simple writes: Use atomic compare-exchange loops without blocking other operations
WriterExclusive locks: Used for removing items, and for unique insertions
Complex evictions: Use FullyExclusive locks to ensure consistency

§Time Complexity

Lookup: O(1)
Deletion: O(1)
Insertion: Amortized O(1) due to eviction chains, but the number of evictions is bounded

Struct CuckooFilter Copy item path

§What Makes It “Cuckoo”

§Algorithm Overview

§Key Advantages Over Bloom Filters

§Concurrency Model

§Time Complexity

Implementations§

impl<H: Hasher + Default> CuckooFilter<H>

pub fn insert<T: ?Sized + Hash>(&self, item: &T) -> Result<(), Error>

pub fn insert_unique<T: ?Sized + Hash>(&self, item: &T) -> Result<bool, Error>

pub fn count<T: ?Sized + Hash>(&self, item: &T) -> usize

§Notes

pub fn remove<T: ?Sized + Hash>(&self, item: &T) -> bool

pub fn contains<T: ?Sized + Hash>(&self, item: &T) -> bool

pub fn len(&self) -> usize

pub fn is_empty(&self) -> bool

pub fn capacity(&self) -> usize

pub fn to_bytes(&self) -> Vec<u8> ⓘ

pub fn from_bytes(bytes: &[u8]) -> Result<Self, DeserializeError>

pub fn clear(&self)

pub fn lock(&self, kind: LockKind) -> Option<Lock<'_>>

impl CuckooFilter<DefaultHasher>

pub fn builder() -> CuckooFilterBuilder<DefaultHasher>

pub fn new() -> CuckooFilter<DefaultHasher>

pub fn with_capacity(capacity: usize) -> CuckooFilter<DefaultHasher>

Trait Implementations§

impl<H> Debug for CuckooFilter<H>where H: Hasher + Default + Debug,

fn fmt(&self, f: &mut Formatter<'_>) -> Result

impl Default for CuckooFilter<DefaultHasher>

fn default() -> Self

impl<'de, H> Deserialize<'de> for CuckooFilter<H>where H: Hasher + Default,

fn deserialize<D>(deserializer: D) -> Result<Self, D::Error>where D: Deserializer<'de>,

impl<H: Hasher + Default> Serialize for CuckooFilter<H>

fn serialize<S>(&self, serializer: S) -> Result<S::Ok, S::Error>where S: Serializer,

Auto Trait Implementations§

impl<H = DefaultHasher> !Freeze for CuckooFilter<H>

impl<H> RefUnwindSafe for CuckooFilter<H>where H: RefUnwindSafe,

impl<H> Send for CuckooFilter<H>where H: Send,

impl<H> Sync for CuckooFilter<H>where H: Sync,

impl<H> Unpin for CuckooFilter<H>where H: Unpin,

impl<H> UnwindSafe for CuckooFilter<H>where H: UnwindSafe,

Blanket Implementations§

impl<T> Any for Twhere T: 'static + ?Sized,

fn type_id(&self) -> TypeId

impl<T> Borrow<T> for Twhere T: ?Sized,

fn borrow(&self) -> &T

impl<T> BorrowMut<T> for Twhere T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

impl<T> From<T> for T

fn from(t: T) -> T

impl<T, U> Into<U> for Twhere U: From<T>,

fn into(self) -> U

impl<T, U> TryFrom<U> for Twhere U: Into<T>,

type Error = Infallible

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

impl<T, U> TryInto<U> for Twhere U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

impl<V, T> VZip<V> for Twhere V: MultiLane<T>,

fn vzip(self) -> V

impl<T> DeserializeOwned for Twhere T: for<'de> Deserialize<'de>,

Struct CuckooFilter

impl<H> Debug for CuckooFilter<H>
where H: Hasher + Default + Debug,

impl<'de, H> Deserialize<'de> for CuckooFilter<H>
where H: Hasher + Default,

fn deserialize<D>(deserializer: D) -> Result<Self, D::Error>
where D: Deserializer<'de>,

fn serialize<S>(&self, serializer: S) -> Result<S::Ok, S::Error>
where S: Serializer,

impl<H> RefUnwindSafe for CuckooFilter<H>
where H: RefUnwindSafe,

impl<H> Send for CuckooFilter<H>
where H: Send,

impl<H> Sync for CuckooFilter<H>
where H: Sync,

impl<H> Unpin for CuckooFilter<H>
where H: Unpin,

impl<H> UnwindSafe for CuckooFilter<H>
where H: UnwindSafe,

impl<T> Any for T
where T: 'static + ?Sized,

impl<T> Borrow<T> for T
where T: ?Sized,

impl<T> BorrowMut<T> for T
where T: ?Sized,

impl<T, U> Into<U> for T
where U: From<T>,

impl<T, U> TryFrom<U> for T
where U: Into<T>,

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

impl<V, T> VZip<V> for T
where V: MultiLane<T>,

impl<T> DeserializeOwned for T
where T: for<'de> Deserialize<'de>,