growable-bloom-filter 0.2.1

Scalable Bloom Filters with serde support
Documentation

#+AUTHOR: David Briggs #+STARTUP: SHOWALL

  • Growable Bloom Filters

[[https://crates.io/crates/growable-bloom-filter][CRATES.IO]] | [[https://docs.rs/growable-bloom-filter/latest/growable_bloom_filter/struct.GrowableBloom.html][DOCUMENTATION]]

** Overview

Implementation of [[http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.62.7953&rep=rep1&type=pdf][Scalable Bloom Filters]] which also provides serde serialization and deserialize.

A bloom filter lets you =insert= items, and then test association with =contains=. It's space and time efficient, at the cost of false positives. In particular, if =contains= returns =true=, it may be in filter. But if =contains= returns false, it's definitely not in the bloom filter.

You can control the failure rate by setting =desired_error_prob= and =est_insertions= appropriately.

#+begin_src rust use serde_json; use growable_bloom_filter::GrowableBloom;

let mut gbloom = GrowableBloom::new(0.05, 1000); gbloom.insert(&0); assert!(gbloom.contains(&0));

let s = serde_json::to_string(&gbloom).unwrap(); let des_gbloom: GrowableBloom = serde_json::from_str(&s).unwrap(); assert!(des_gbloom.contains(&0)); #+end_src

** Applications

Bloom filters are typically used as a pre-cache to avoid expensive operations. For example, if you need to ask ten thousand servers if they have data XYZ, you could use GrowableBloom to figure out which ones do NOT have XYZ.