Skip to main content

find_potential_duplicates

Function find_potential_duplicates 

Source
pub fn find_potential_duplicates(
    similarity_matrix: &[SimilarityPair],
    duplicate_threshold: f32,
) -> Vec<(String, String, f32)>
Expand description

Finds potential duplicate skills by reusing the similarity matrix data

Returns pairs with similarity >= duplicate_threshold, sorted by similarity descending. Uses a HashSet to avoid reporting the same pair twice (A-B and B-A).