Module basic_text_internals::unicode
source · Constants
ASCII BEL.
ZERO WIDTH NO-BREAK SPACE, also known as the byte-order mark, or BOM
ASCII CAN.
COMBINING GRAPHEME JOINER
ASCII DEL, which is not what’s generated by the “delete” key on the
keyboard
ASCII ESC, known as ‘\e’ in some contexts.
ASCII FF, known as ‘\f’ in some contexts.
LINE SEPARATOR
The size of the longest UTF-8 scalar value encoding. Note that even though
RFC-2279 allowed longer encodings, it’s obsoleted by RFC-3629 which
doesn’t. This limit is also documented in the relevant section of Rust’s
documentation.
EBCDIC NEXT LINE, which is treated like generic whitespace.
The minimum size of a buffer needed to perform NFC normalization, and thus
the minimum size needed to pass to
TextReader
’s read
.OBJECT REPLACEMENT CHARACTER
PARAGRAPH SEPARATOR
REPLACEMENT CHARACTER
ASCII SUB.
WORD JOINER
ZERO WIDTH JOINER