lshdedup-core
Pure-Rust core for lshdedup: MinHash + LSH near-duplicate detection.
use ;
let mut idx = new;
idx.insert.unwrap;
idx.insert.unwrap;
let hits = idx.near_duplicates;
assert!;
License
Dual-licensed under MIT or Apache-2.0.
Pure-Rust core for lshdedup: MinHash + LSH near-duplicate detection.
use lshdedup_core::{Config, Index};
let mut idx = Index::new(Config::default());
idx.insert("a", "the quick brown fox").unwrap();
idx.insert("b", "the quick brown fox jumps").unwrap();
let hits = idx.near_duplicates("the quick brown fox", 0.5);
assert!(!hits.is_empty());
Dual-licensed under MIT or Apache-2.0.