Crate fast_rustc_ap_rustc_lexer

Expand description

Low-level Rust lexer.

Tokens produced by this lexer are not yet ready for parsing the Rust syntax, for that see librustc_parse::lexer, which converts this basic token stream into wide tokens used by actual parser.

The purpose of this crate is to convert raw sources into a labeled sequence of well-known token types, so building an actual Rust token stream will be easier.

Main entity of this crate is TokenKind enum which represents common lexeme types.

Modules§

unescape: Utilities for validating string and char literals and turning them into values they represent.

Structs§

Token: Parsed token. It doesn’t contain information about data that has been parsed, only the type of the token and its size.

Enums§

Base: Base of numeric literal encoding according to its prefix.
LiteralKind
TokenKind: Enum representing common lexeme types.

Functions§

first_token: Parses the first token from the provided input string.
is_id_continue: True if c is valid as a non-first character of an identifier. See Rust language reference for a formal definition of valid identifier name.
is_id_start: True if c is valid as a first character of an identifier. See Rust language reference for a formal definition of valid identifier name.
is_whitespace: True if c is considered a whitespace according to Rust language definition. See Rust language reference for definitions of these classes.
strip_shebang: rustc allows files to have a shebang, e.g. “#!/usr/bin/rustrun”, but shebang isn’t a part of rust syntax, so this function skips the line if it starts with a shebang (“#!”). Line won’t be skipped if it represents a valid Rust syntax (e.g. “#![deny(missing_docs)]”).
tokenize: Creates an iterator that produces tokens from the input string.

Crate fast_rustc_ap_rustc_lexerCopy item path

Modules§

Structs§

Enums§

Functions§

Crate fast_rustc_ap_rustc_lexer