Skip to main contentModule clones
Source - bucket_key
- Compute the coarse bucket key for a structural fingerprint.
- cluster_matches
- Cluster a slice of
CloneMatch records into connected components using
Union-Find, then return a sorted Vec<CloneCluster> (largest first). - compare_pair
- Compare two source code snippets and return a
CloneMatch if they are
similar enough, or None otherwise. - compute_fingerprints
- Compute structural fingerprints for all symbols that pass the
min_lines
filter in config. - group_into_buckets
- Group a slice of fingerprints by their bucket key.
- jaccard_similarity
- Jaccard similarity on token multisets.
- normalize_identifiers
- Replace identifiers with positional placeholders.
- tokenize
- Split source code into tokens.