Expand description

Stringmetrics library

A library for applying text- and token- based comparison algorithms to determine the similarity of two strings or sets. The core modules are algorithms, which contains algorithms for determining closeness, and [collectors] which has helper functions for preparing anything for tokenization and comparison.

Note that this module is very much a work in progress, and it is likely that interfaces may change. Large parts of this module are still under construction. The eventual goal is to implement many of the metrics listed on this wikipedia page, generalized to any hashable datatype.

Modules

This module contains functions for applying various closeness algorithms.