Module basic_text_internals::unicode
source · Constants§
- ASCII BEL.
- ZERO WIDTH NO-BREAK SPACE, also known as the byte-order mark, or BOM
- ASCII CAN.
- COMBINING GRAPHEME JOINER
- ASCII DEL, which is not what’s generated by the “delete” key on the keyboard
- ASCII ESC, known as ‘\e’ in some contexts.
- ASCII FF, known as ‘\f’ in some contexts.
- LINE SEPARATOR
- The size of the longest UTF-8 scalar value encoding. Note that even though RFC-2279 allowed longer encodings, it’s obsoleted by RFC-3629 which doesn’t. This limit is also documented in the relevant section of Rust’s documentation.
- EBCDIC NEXT LINE, which is treated like generic whitespace.
- The minimum size of a buffer needed to perform NFC normalization, and thus the minimum size needed to pass to
TextReader
’sread
. - OBJECT REPLACEMENT CHARACTER
- PARAGRAPH SEPARATOR
- REPLACEMENT CHARACTER
- ASCII SUB.
- WORD JOINER
- ZERO WIDTH JOINER