Skip to main content

cluster_canonicals

Function cluster_canonicals 

Source
pub fn cluster_canonicals(
    canonicals: &[String],
    threshold: f64,
) -> Vec<(Vec<usize>, f64)>
Expand description

cluster_canonicals(canonicals, threshold)[(member indices, min pairwise ratio)].

Exact single-linkage clustering: ratio >= threshold joins two strings; each returned cluster (size >= 2) carries its exact minimum intra-cluster ratio. Bit-identical to the reference pairwise clustering — just far faster (suffix automaton + early-exit + rayon).