Module basic_text_internals::unicode [−][src]
Constants
ASCII BEL.
ZERO WIDTH NO-BREAK SPACE, also known as the byte-order mark, or BOM
ASCII CAN.
COMBINING GRAPHEME JOINER
ASCII DEL, which is not what’s generated by the “delete” key on the keyboard
ASCII ESC, known as ‘\e’ in some contexts.
ASCII FF, known as ‘\f’ in some contexts.
LINE SEPARATOR
The size of the longest UTF-8 scalar value encoding. Note that even though RFC-2279 allowed longer encodings, it’s obsoleted by RFC-3629 which doesn’t. This limit is also documented in the relevant section of Rust’s documentation.
EBCDIC NEXT LINE, which is treated like generic whitespace.
The minimum size of a buffer needed to perform NFC normalization, and thus
the minimum size needed to pass to TextReader
’s read
.
OBJECT REPLACEMENT CHARACTER
PARAGRAPH SEPARATOR
REPLACEMENT CHARACTER
ASCII SUB.
WORD JOINER
ZERO WIDTH JOINER