[][src]Module ed_join::matching

Functions

calc_prefix_len

Given a set of q-grams, find the minimum length of prefix such that if all the q-grams in the prefix are mismatched, it will incur at least tau + ed/it errors.

ed_join

Given two input files, doc_x and doc_y, and two parameters, q and tau, find all records in doc_y that match records in doc_x such that are matched pairs have edit-distance smaller or equal to tau.

min_edit_errors

Given a set of q-grams, find the minimum number of edit operations that destroys all q-grams.