Expand description
Probabilistic data structures for scalable approximate analytics.
The crate currently exposes:
minmax_sketch::MinMaxSketchfor approximate frequency estimation.hyperloglog::HyperLogLogfor approximate cardinality estimation.jacardfor approximate set overlap/Jaccard helpers on HyperLogLog.bloom_filter::BloomFilterfor approximate set membership checks.count_sketch::CountSketchfor signed approximate frequency estimation.space_saving::SpaceSavingfor approximate heavy hitters.kll::KllSketchfor approximate quantiles.tdigest::TDigestfor tail-friendly quantiles.cuckoo_filter::CuckooFilterfor membership with deletions.minhash::MinHashfor approximate Jaccard estimation.lsh_minhash::MinHashLshIndexfor approximate nearest-neighbor lookup.reservoir_sampling::ReservoirSamplingfor uniform stream sampling.
Modules§
- bloom_
filter - Bloom filter for approximate set membership.
- count_
sketch - Count Sketch for approximate frequency estimation.
- cuckoo_
filter - Cuckoo filter for approximate set membership with deletions.
- hyperloglog
- HyperLogLog cardinality estimator.
- jacard
- Jaccard similarity trait shared by sketch implementations.
- kll
- KLL sketch for approximate quantile queries.
- lsh_
minhash - MinHash banding LSH index for approximate nearest-neighbor candidate search.
- minhash
- MinHash sketch for approximate Jaccard similarity.
- minmax_
sketch - MinMax sketch for approximate frequency counting.
- reservoir_
sampling - Reservoir sampling for uniform samples from streaming data.
- space_
saving - Space-Saving sketch for approximate heavy hitters.
- tdigest
- t-digest for approximate quantiles, especially in distribution tails.
Enums§
- Sketch
Error - Errors returned by sketch constructors and merge operations.