Module basic_text_internals::unicode[][src]

Constants

ASCII BEL.

ZERO WIDTH NO-BREAK SPACE, also known as the byte-order mark, or BOM

ASCII CAN.

COMBINING GRAPHEME JOINER

ASCII DEL, which is not what’s generated by the “delete” key on the keyboard

ASCII ESC, known as ‘\e’ in some contexts.

ASCII FF, known as ‘\f’ in some contexts.

LINE SEPARATOR

The size of the longest UTF-8 scalar value encoding. Note that even though RFC-2279 allowed longer encodings, it’s obsoleted by RFC-3629 which doesn’t. This limit is also documented in the relevant section of Rust’s documentation.

EBCDIC NEXT LINE, which is treated like generic whitespace.

The minimum size of a buffer needed to perform NFC normalization, and thus the minimum size needed to pass to TextReader’s read.

OBJECT REPLACEMENT CHARACTER

PARAGRAPH SEPARATOR

REPLACEMENT CHARACTER

ASCII SUB.

WORD JOINER

ZERO WIDTH JOINER

Functions