Skip to main content

Crate rehuman

Crate rehuman 

Source
Expand description

rehuman — Unicode‑safe text cleaning & typographic normalization.

Structs§

CleaningOptions
Configuration for cleaning.
CleaningOptionsBuilder
Builder for CleaningOptions.
CleaningResult
Result of a text cleaning operation.
CleaningStats
Detailed statistics about cleaning operations.
StreamCleaner
Incremental cleaner that processes text in newline-delimited chunks.
StreamSummary
Summary of cumulative streaming cleanup work.
TextCleaner
Main cleaner.

Enums§

CleaningError
Errors produced by fallible cleaning APIs.
EmojiPolicy
Policy for emoji handling when keyboard_only is enabled.
LineEndingStyle
Line ending styles.
NonAsciiPolicy
Policy for handling non-ASCII graphemes in keyboard_only mode.
UnicodeNormalizationMode
Unicode normalization modes.

Functions§

clean
Convenience: clean with default options.
humanize
Convenience: clean with the humanize preset.
is_emoji
Emoji detection via the Unicode Emoji binary property.
is_extended_keyboard_char
Curated non-ASCII characters allowed in extended keyboard mode.
is_hidden_char
Hidden/format-like characters defined by Default_Ignorable_Code_Point (DI).
is_keyboard_ascii
ASCII keyboard (US) characters + whitespace controls typically produced by keyboards.