pub struct Bloom<T> { /* private fields */ }
Expand description
Bloom filter.
This is a basic Bloom filter, using SipHasher13 hash funcs. It can be fine-tuned by passing custom parameters when being initialized. The default constructor uses a false positive rate of 1% and 2 hash functions.
Heavily inspired from bloomfilter crate.
§Examples
use ofilter::Bloom;
use std::hash::Hash;
#[derive(Hash)]
struct Obj {
some: usize,
thing: usize,
}
let mut filter: Bloom<Obj> = Bloom::new(100);
let obj = Obj{ some: 1, thing: 2};
assert!(!filter.check(&obj)); // object is not in filter
filter.set(&obj); // put object in filter
assert!(filter.check(&obj)); // object is in filter
Implementations§
source§impl<T> Bloom<T>
impl<T> Bloom<T>
sourcepub fn new(capacity: usize) -> Self
pub fn new(capacity: usize) -> Self
Create a new Bloom filter, with given capacity.
All other parameters are set to defaults, or aligned to match capacity.
This type of filter is useful to maintain a set of max capacity
items.
It can have false positives but no false negatives. This means you
can be sure that an item does NOT belong to the set, but only have
a clue whether the item is in the set or not.
§Examples
use ofilter::Bloom;
let filter: Bloom<usize> = Bloom::new(100);
assert_eq!(100, filter.capacity());
sourcepub fn new_with_params(params: Params) -> Self
pub fn new_with_params(params: Params) -> Self
Create a new Bloom filter, with specific parameters.
§Examples
use ofilter::{Bloom, Params};
let filter: Bloom<usize> = Bloom::new_with_params(Params::with_nb_items_and_fp_rate(100, 0.1));
assert_eq!(100, filter.capacity());
sourcepub fn params(&self) -> Params
pub fn params(&self) -> Params
Get filter params.
This can be useful because when creating the filter, the .adjust()
func is called, and may decide to fine-tune some parameters. With this,
one can know exactly what is used by the filter.
§Examples
use ofilter::Bloom;
let filter: Bloom<usize> = Bloom::new(100);
println!("{}", filter.params());
sourcepub fn capacity(&self) -> usize
pub fn capacity(&self) -> usize
Get filter capacity.
Returns the value of params.nb_items
, that is the number of
items the filter is designed for.
use ofilter::{Bloom, Params};
let filter: Bloom<usize> = Bloom::new_with_params(Params::with_bit_len(1_000_000));
assert_eq!(52681, filter.capacity());
sourcepub fn clear(&mut self)
pub fn clear(&mut self)
Clear the filter.
Clears the bit vector, but keeps parameters.
use ofilter::Bloom;
let mut filter: Bloom<usize> = Bloom::new(1_000);
filter.set(&10);
assert!(filter.check(&10));
filter.clear();
assert!(!filter.check(&10));
sourcepub fn is_empty(&self) -> bool
pub fn is_empty(&self) -> bool
Returns true if filter is empty.
use ofilter::Bloom;
let mut filter: Bloom<usize> = Bloom::new(1_000);
assert!(filter.is_empty());
filter.set(&10);
assert!(!filter.is_empty());
sourcepub fn fp_rate(&self) -> f64
pub fn fp_rate(&self) -> f64
Returns the current false positive rate.
In theory the false positive rate fp_rate
is known at
filter creation. But that, is the theoretical fp_rate
that the filter reaches when it is “wasted” because it
has too many entries. Until then, it performs better
than that, statistically.
This function returns the actual false positive rate.
When this value is greater than the fp_rate
from
the parameters, one should not continue to add items
to the filter.
use ofilter::Bloom;
let mut filter: Bloom<usize> = Bloom::new(1_000);
assert_eq!(0.0, filter.fp_rate());
filter.set(&10);
assert!(filter.fp_rate() > 0.0); // will be params.fp_rate when filter is full
sourcepub fn level(&self) -> f64
pub fn level(&self) -> f64
Returns the ratio between real fp_rate, and theoretical fp_rate.
This is a helper to quickly compare the real fp_rate
, given
how filled the filter is, and the theoretical fp_rate
.
When this value is greater than 1.0, one should not continue to add items to the filter.
use ofilter::Bloom;
let mut filter: Bloom<usize> = Bloom::new(1_000);
assert_eq!(0.0, filter.level());
filter.set(&10);
assert!(filter.level() > 0.0); // will be 1.0 when filter is full
sourcepub fn is_ok(&self) -> bool
pub fn is_ok(&self) -> bool
Returns true if there is still room in the filter.
This is a helper which returns true if the actual
fp_rate
is lower than the theoretical fp_rate
.
When this is false, one should not continue to add items to the filter.
use ofilter::Bloom;
let mut filter: Bloom<usize> = Bloom::new(1_000);
assert_eq!(0.0, filter.level());
filter.set(&10);
assert!(filter.level() > 0.0); // will be 1.0 when filter is full
sourcepub fn count_ones(&self) -> usize
pub fn count_ones(&self) -> usize
Returns the number of ones in the bit vector.
By comparing this number with the size of the bit vector, one can estimate how “filled” the filter is.
use ofilter::Bloom;
let mut filter: Bloom<usize> = Bloom::new(1_000_000);
assert_eq!(0, filter.count_ones());
filter.set(&10);
assert_eq!(2, filter.count_ones());
sourcepub fn export(&self) -> &BitVec ⓘ
pub fn export(&self) -> &BitVec ⓘ
Export the underlying bit vector.
use ofilter::Bloom;
let mut filter: Bloom<usize> = Bloom::new(1_000_000);
let bit_vec = filter.export();
println!("{:?}", bit_vec);
sourcepub fn import(&mut self, src: &BitVec)
pub fn import(&mut self, src: &BitVec)
Import the underlying bit vector.
use ofilter::Bloom;
use bitvec::prelude::BitVec;
let mut filter: Bloom<usize> = Bloom::new(1_000_000);
let mut bit_vec = BitVec::new();
bit_vec.resize(100, false);
bit_vec.set(42, true);
filter.import(&bit_vec); // safe to call even is sizes mismatch
source§impl<T> Bloom<T>where
T: Hash,
impl<T> Bloom<T>where
T: Hash,
sourcepub fn set(&mut self, item: &T)
pub fn set(&mut self, item: &T)
Record an item in the set.
Once this has been called, any call to check()
will return
true, as there are no false negatives. However some other items
may test positive as a consequence of recording this one.
use ofilter::Bloom;
let mut filter: Bloom<usize> = Bloom::new(1_000);
filter.set(&42);
sourcepub fn check(&self, item: &T) -> bool
pub fn check(&self, item: &T) -> bool
Guess whether an item is likely to be in the set.
If set()
has been called before with value, then this returns
true, as there are no false negatives. However it may respond
true even if the item has never been recorded in the set.
use ofilter::Bloom;
let mut filter: Bloom<usize> = Bloom::new(1_000);
filter.set(&42);
assert!(filter.check(&42));
sourcepub fn check_and_set(&mut self, item: &T) -> bool
pub fn check_and_set(&mut self, item: &T) -> bool
Record an item in the set and returns its previous value.
Equivalent to calling get()
then set()
but performs
hash lookup only once so it’s a bit more efficient.
use ofilter::Bloom;
let mut filter: Bloom<usize> = Bloom::new(1_000);
assert!(!filter.check_and_set(&42));
assert!(filter.check(&42));
Trait Implementations§
source§impl<'de, T> Deserialize<'de> for Bloom<T>where
T: Deserialize<'de>,
impl<'de, T> Deserialize<'de> for Bloom<T>where
T: Deserialize<'de>,
source§fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
Auto Trait Implementations§
impl<T> Freeze for Bloom<T>
impl<T> RefUnwindSafe for Bloom<T>where
T: RefUnwindSafe,
impl<T> Send for Bloom<T>where
T: Send,
impl<T> Sync for Bloom<T>where
T: Sync,
impl<T> Unpin for Bloom<T>where
T: Unpin,
impl<T> UnwindSafe for Bloom<T>where
T: UnwindSafe,
Blanket Implementations§
source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
source§impl<T> FmtForward for T
impl<T> FmtForward for T
source§fn fmt_binary(self) -> FmtBinary<Self>where
Self: Binary,
fn fmt_binary(self) -> FmtBinary<Self>where
Self: Binary,
self
to use its Binary
implementation when Debug
-formatted.source§fn fmt_display(self) -> FmtDisplay<Self>where
Self: Display,
fn fmt_display(self) -> FmtDisplay<Self>where
Self: Display,
self
to use its Display
implementation when
Debug
-formatted.source§fn fmt_lower_exp(self) -> FmtLowerExp<Self>where
Self: LowerExp,
fn fmt_lower_exp(self) -> FmtLowerExp<Self>where
Self: LowerExp,
self
to use its LowerExp
implementation when
Debug
-formatted.source§fn fmt_lower_hex(self) -> FmtLowerHex<Self>where
Self: LowerHex,
fn fmt_lower_hex(self) -> FmtLowerHex<Self>where
Self: LowerHex,
self
to use its LowerHex
implementation when
Debug
-formatted.source§fn fmt_octal(self) -> FmtOctal<Self>where
Self: Octal,
fn fmt_octal(self) -> FmtOctal<Self>where
Self: Octal,
self
to use its Octal
implementation when Debug
-formatted.source§fn fmt_pointer(self) -> FmtPointer<Self>where
Self: Pointer,
fn fmt_pointer(self) -> FmtPointer<Self>where
Self: Pointer,
self
to use its Pointer
implementation when
Debug
-formatted.source§fn fmt_upper_exp(self) -> FmtUpperExp<Self>where
Self: UpperExp,
fn fmt_upper_exp(self) -> FmtUpperExp<Self>where
Self: UpperExp,
self
to use its UpperExp
implementation when
Debug
-formatted.source§fn fmt_upper_hex(self) -> FmtUpperHex<Self>where
Self: UpperHex,
fn fmt_upper_hex(self) -> FmtUpperHex<Self>where
Self: UpperHex,
self
to use its UpperHex
implementation when
Debug
-formatted.source§impl<T> Pipe for Twhere
T: ?Sized,
impl<T> Pipe for Twhere
T: ?Sized,
source§fn pipe<R>(self, func: impl FnOnce(Self) -> R) -> Rwhere
Self: Sized,
fn pipe<R>(self, func: impl FnOnce(Self) -> R) -> Rwhere
Self: Sized,
source§fn pipe_ref<'a, R>(&'a self, func: impl FnOnce(&'a Self) -> R) -> Rwhere
R: 'a,
fn pipe_ref<'a, R>(&'a self, func: impl FnOnce(&'a Self) -> R) -> Rwhere
R: 'a,
self
and passes that borrow into the pipe function. Read moresource§fn pipe_ref_mut<'a, R>(&'a mut self, func: impl FnOnce(&'a mut Self) -> R) -> Rwhere
R: 'a,
fn pipe_ref_mut<'a, R>(&'a mut self, func: impl FnOnce(&'a mut Self) -> R) -> Rwhere
R: 'a,
self
and passes that borrow into the pipe function. Read moresource§fn pipe_borrow<'a, B, R>(&'a self, func: impl FnOnce(&'a B) -> R) -> R
fn pipe_borrow<'a, B, R>(&'a self, func: impl FnOnce(&'a B) -> R) -> R
source§fn pipe_borrow_mut<'a, B, R>(
&'a mut self,
func: impl FnOnce(&'a mut B) -> R
) -> R
fn pipe_borrow_mut<'a, B, R>( &'a mut self, func: impl FnOnce(&'a mut B) -> R ) -> R
source§fn pipe_as_ref<'a, U, R>(&'a self, func: impl FnOnce(&'a U) -> R) -> R
fn pipe_as_ref<'a, U, R>(&'a self, func: impl FnOnce(&'a U) -> R) -> R
self
, then passes self.as_ref()
into the pipe function.source§fn pipe_as_mut<'a, U, R>(&'a mut self, func: impl FnOnce(&'a mut U) -> R) -> R
fn pipe_as_mut<'a, U, R>(&'a mut self, func: impl FnOnce(&'a mut U) -> R) -> R
self
, then passes self.as_mut()
into the pipe
function.source§fn pipe_deref<'a, T, R>(&'a self, func: impl FnOnce(&'a T) -> R) -> R
fn pipe_deref<'a, T, R>(&'a self, func: impl FnOnce(&'a T) -> R) -> R
self
, then passes self.deref()
into the pipe function.source§impl<T> Tap for T
impl<T> Tap for T
source§fn tap_borrow<B>(self, func: impl FnOnce(&B)) -> Self
fn tap_borrow<B>(self, func: impl FnOnce(&B)) -> Self
Borrow<B>
of a value. Read moresource§fn tap_borrow_mut<B>(self, func: impl FnOnce(&mut B)) -> Self
fn tap_borrow_mut<B>(self, func: impl FnOnce(&mut B)) -> Self
BorrowMut<B>
of a value. Read moresource§fn tap_ref<R>(self, func: impl FnOnce(&R)) -> Self
fn tap_ref<R>(self, func: impl FnOnce(&R)) -> Self
AsRef<R>
view of a value. Read moresource§fn tap_ref_mut<R>(self, func: impl FnOnce(&mut R)) -> Self
fn tap_ref_mut<R>(self, func: impl FnOnce(&mut R)) -> Self
AsMut<R>
view of a value. Read moresource§fn tap_deref<T>(self, func: impl FnOnce(&T)) -> Self
fn tap_deref<T>(self, func: impl FnOnce(&T)) -> Self
Deref::Target
of a value. Read moresource§fn tap_deref_mut<T>(self, func: impl FnOnce(&mut T)) -> Self
fn tap_deref_mut<T>(self, func: impl FnOnce(&mut T)) -> Self
Deref::Target
of a value. Read moresource§fn tap_dbg(self, func: impl FnOnce(&Self)) -> Self
fn tap_dbg(self, func: impl FnOnce(&Self)) -> Self
.tap()
only in debug builds, and is erased in release builds.source§fn tap_mut_dbg(self, func: impl FnOnce(&mut Self)) -> Self
fn tap_mut_dbg(self, func: impl FnOnce(&mut Self)) -> Self
.tap_mut()
only in debug builds, and is erased in release
builds.source§fn tap_borrow_dbg<B>(self, func: impl FnOnce(&B)) -> Self
fn tap_borrow_dbg<B>(self, func: impl FnOnce(&B)) -> Self
.tap_borrow()
only in debug builds, and is erased in release
builds.source§fn tap_borrow_mut_dbg<B>(self, func: impl FnOnce(&mut B)) -> Self
fn tap_borrow_mut_dbg<B>(self, func: impl FnOnce(&mut B)) -> Self
.tap_borrow_mut()
only in debug builds, and is erased in release
builds.source§fn tap_ref_dbg<R>(self, func: impl FnOnce(&R)) -> Self
fn tap_ref_dbg<R>(self, func: impl FnOnce(&R)) -> Self
.tap_ref()
only in debug builds, and is erased in release
builds.source§fn tap_ref_mut_dbg<R>(self, func: impl FnOnce(&mut R)) -> Self
fn tap_ref_mut_dbg<R>(self, func: impl FnOnce(&mut R)) -> Self
.tap_ref_mut()
only in debug builds, and is erased in release
builds.source§fn tap_deref_dbg<T>(self, func: impl FnOnce(&T)) -> Self
fn tap_deref_dbg<T>(self, func: impl FnOnce(&T)) -> Self
.tap_deref()
only in debug builds, and is erased in release
builds.