lshdedup-core 0.1.1

Pure-Rust core for lshdedup: MinHash + LSH near-duplicate detection.
Documentation
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
# lshdedup-core

Pure-Rust core for [lshdedup](https://github.com/MukundaKatta/lshdedup):
MinHash + LSH near-duplicate detection.

```rust
use lshdedup_core::{Config, Index};

let mut idx = Index::new(Config::default());
idx.insert("a", "the quick brown fox").unwrap();
idx.insert("b", "the quick brown fox jumps").unwrap();
let hits = idx.near_duplicates("the quick brown fox", 0.5);
assert!(!hits.is_empty());
```

## License

Dual-licensed under MIT or Apache-2.0.