textprep 0.1.0

Text preprocessing primitives: normalization, tokenization, and fast keyword matching.
Documentation

textprep

Text preprocessing primitives for the representational stack.

Provides Unicode normalization, case folding, diacritics stripping, tokenization, and fast keyword matching.