[−][src]Crate spatium
SPATIUM
SPATIUM is a library for calcuting distances beetween sequences.
Algorithms:
Edit based:
References:
Distance Libraries:
Python libs:
- python textdistance
- python abydos
- python Distance
- python jellyfish
- python py_stringmatching
- python pylev
- python python-Levenshtein
- python pyxDamerauLevenshtein
Large libraries
- https://github.com/rockymadden/stringmetric
- https://github.com/smashedtoatoms/the_fuzz
- https://github.com/howardjp/phonics
- https://github.com/MrPowers/spark-stringmetric
- https://github.com/n7v/phonetic
- https://github.com/wolfpaulus/phonetic-alg-compare
- https://github.com/kolipass/phonetic-lib
- https://github.com/DaveJPoole/Phonetics
- https://github.com/keerthikoneru/Phonetic-Matching-Tool-Kit-with-State-of-the-Art-Meta-Soundex-Algorithm
Smaller libraries with gimmick algorithms
- https://github.com/eldersantos/phonix (Match Rating)
- https://github.com/johnjansen/text (Porter Stemming and White similarity)
- https://github.com/mlobl/string-metrics (pure string metrics)
- https://github.com/Cortexelus/anaphone-solver (CMU dictionary)
- https://github.com/CasualSuperman/phonetics (SIFT3)
- https://github.com/ahysing/norphone (Norwegian)
- https://github.com/outhwest/pysounds (literary hardness)
Custom algorithms
- https://github.com/ticki/eudex (probability based)
- https://github.com/lingz/fast_fuzzy_search
- https://github.com/jze/phonet4java (Phonet)
- https://github.com/ViniciusMRosa/phonetic-algorithms (Portugese)
- https://github.com/flezzfx/phonetizz (Phonetizz)
- https://github.com/olsgaard/phonetic_search (Phonix)
Also for long lists
-
https://github.com/ecomp-shONgit/string-distance
-
https://github.com/vickumar1981/stringdistance
-
https://github.com/luozhouyang/python-string-similarity
-
https://github.com/markvanderloo/stringdist
-
https://github.com/feature23/StringSimilarity.NET
-
https://github.com/matthieugomez/StringDistances.jl
-
https://github.com/dexyk/stringosim
-
https://github.com/StefH/SimMetrics.Net
-
https://github.com/ColinFay/tidystringdist
-
https://github.com/miku/stardust
-
https://github.com/tdebatty/java-string-similarity
-
And some of the repos have special algorithms
-
https://github.com/timoxley/sift (SIFT1)
-
https://github.com/PrismaPhonic/sift4-rs (SIFT4)
-
https://github.com/auguscl/Dist4Seq (BSMpre/PURDUEpre)
-
https://github.com/RThevenoux/ipa-distance (IPA linguistic distance)
-
https://github.com/ferreirafabio/minimum-edit-distance-py (Wagner-Fischer)
-
https://github.com/rahulpedduri/LCS (LCS)
-
https://github.com/NickRimmer/StringCompare (Tanimoto coefficient)
-
https://github.com/xrash/smetrics (Ukkonen)
-
https://github.com/hakanozbay/ozbay-metric (Longest Common Substring Distance)
-
https://github.com/mbrlabs/distance (SIFT3)
-
https://github.com/ychantit/fuzzymatch_hiveUDF (Ngram)
-
https://github.com/MailOnline/s-metric (naive brute force)
-
https://github.com/a1trl9/flatwhite (ged/led)
-
https://github.com/winkjs/wink-distance (vector-based distances)
-
https://github.com/ianozsvald/string_distance_metrics (Title string lengths and Uni/bi/trigram distances)
-
https://github.com/accidental-bebop/BkStringMatch (q-gram distance)
-
https://github.com/Andreshk/ApproximateStringMatching ("unique")
-
https://github.com/blester125/string_distance (brew and ratcliff_obershelft)
Modules
edit_based | Edit Based algorithms |
error | Library error class |