polyphon 1.0.2

Implementation of phonetic Polyphon algorithm (PyO3 core wrapper)
Documentation

About

It's an effective implementation of phonetic Polyphon algorithm.

Original paper: «Polyphon: An Algorithm for Phonetic String Matching in Russian Language».

Authors: Viacheslav V. Paramonov, Alexey O. Shigarov, Gennagy M. Ruzhnikov, Polina V. Belykh.

We propose a new phonetic algorithm to string matching in Russian language without transliteration from Cyrillic to Latin characters. It is based on the rules of sounds formation in Russian language.

Usage

Add the dependency:

$ pip install polyphon

And then use:

from polyphon import encode

print(encode("литие")) # лата

Note: encode works on a single word and removes any non-Russian characters (including spaces). If you want to encode multiple words, split them first and encode each separately.

This is a wrapper of Rust implementation. The repo is here.