lshdedup-core 0.1.1

Pure-Rust core for lshdedup: MinHash + LSH near-duplicate detection.
Documentation

lshdedup-core

Pure-Rust core for lshdedup: MinHash + LSH near-duplicate detection.

use lshdedup_core::{Config, Index};

let mut idx = Index::new(Config::default());
idx.insert("a", "the quick brown fox").unwrap();
idx.insert("b", "the quick brown fox jumps").unwrap();
let hits = idx.near_duplicates("the quick brown fox", 0.5);
assert!(!hits.is_empty());

License

Dual-licensed under MIT or Apache-2.0.