Expand description
Text normalization and hashing utilities for ISCC code generation.
Provides text cleaning, trimming, collapsing, and BLAKE3 multihash functions
ported from iscc-core code_meta.py and utils.py.
Functionsยง
- text_
clean - Clean and normalize text for display.
- text_
collapse - Normalize and simplify text for similarity hashing.
- text_
remove_ newlines - Remove newlines and collapse whitespace to single spaces.
- text_
trim - Trim text so its UTF-8 encoded size does not exceed
nbytes.