Expand description
Content normalization for deterministic hashing.
Normalization ensures that semantically equivalent content produces identical hashes, regardless of superficial differences like whitespace or Unicode representation.
Structs§
- Normalization
Config - Normalization configuration
Enums§
- Line
Ending Norm - Line ending normalization
- Unicode
Form - Unicode normalization form (per TR15)
- Whitespace
Norm - Whitespace normalization strategy
Functions§
- canonical_
json - Canonical JSON serialization (RFC 8785).
- is_
cjk_ character - Check if a character is CJK (Chinese, Japanese, Korean)
- normalize_
content - Normalize content for hashing.
- normalize_
text - Normalize a text string according to configuration.