Constants§

  • ASCII BEL.
  • ZERO WIDTH NO-BREAK SPACE, also known as the byte-order mark, or BOM
  • ASCII CAN.
  • COMBINING GRAPHEME JOINER
  • ASCII DEL, which is not what’s generated by the “delete” key on the keyboard
  • ASCII ESC, known as ‘\e’ in some contexts.
  • ASCII FF, known as ‘\f’ in some contexts.
  • LINE SEPARATOR
  • The size of the longest UTF-8 scalar value encoding. Note that even though RFC-2279 allowed longer encodings, it’s obsoleted by RFC-3629 which doesn’t. This limit is also documented in the relevant section of Rust’s documentation.
  • EBCDIC NEXT LINE, which is treated like generic whitespace.
  • The minimum size of a buffer needed to perform NFC normalization, and thus the minimum size needed to pass to TextReader’s read.
  • OBJECT REPLACEMENT CHARACTER
  • PARAGRAPH SEPARATOR
  • REPLACEMENT CHARACTER
  • ASCII SUB.
  • WORD JOINER
  • ZERO WIDTH JOINER

Functions§