Expand description
§RFC9839-rs
A rust implementation of RFC9839 to test for problematic Unicode code points
Modules§
- control
- Characters which are part of the ASCII control character range or extended ASCII
Structs§
- Unicode
Assignables - Unicode code points that are not problematic. As specified by RFC9839.
- Unicode
Scalars - Any Unicode code point except high-surrogate and low-surrogate code points. As specified by Unicode 16
- XmlCharacters
- Unicode code points that excludes surrogates, legacy C0 controls, and the noncharacters U+FFFE and U+FFFF. As specified by the XML 1.0 specification.
Functions§
- is_
noncharacter - Checks if the value is outside the range of Unicode code points
- is_
unicode_ surrotate - Check if the value is either a low or high surrogate these characters should not be encoded as part of a UTF-8 stream.