Crate facet_derive_parse

Source

Expand description

facet-derive-parse

Logo by Misiasart

Thanks to all individual and corporate sponsors, without whom this work could not exist:

Parses Rust syntax for facet-derive

§License

Licensed under either of:

Apache License, Version 2.0 (LICENSE-APACHE or http://www.apache.org/licenses/LICENSE-2.0)
MIT license (LICENSE-MIT or http://opensource.org/licenses/MIT)

at your option.

Modules§

combinator: A unique feature of unsynn is that one can define a parser as a composition of other parsers on the fly without the need to define custom structures. This is done by using the Cons and Either types. The Cons type is used to define a parser that is a conjunction of two to four other parsers, while the Either type is used to define a parser that is a disjunction of two to four other parsers.
container: This module provides parsers for types that contain possibly multiple values. This includes stdlib types like Option, Vec, Box, Rc, RefCell and types for delimited and repeated values with numbered repeats.
delimited: For easier composition we define the Delimited type here which is a T followed by a optional delimiting entity D. This is used by the DelimitedVec type to parse a list of entities separated by a delimiter.
fundamental: This module contains the fundamental parsers. These parsers are the basic tokens from proc_macro2 and a few other ones defined by unsynn. These are the terminal entities when parsing tokens. Being able to parse TokenTree and TokenStream allows one to parse opaque entities where internal details are left out. The Cached type is used to cache the string representation of the parsed entity. The Nothing type is used to match without consuming any tokens. The Except type is used to match when the next token does not match the given type. The EndOfStream type is used to match the end of the stream when no tokens are left. The HiddenState type is used to hold additional information that is not part of the parsed syntax.
group: Groups are a way to group tokens together. They are used to represent the contents between (), {}, [] or no delimiters at all. This module provides parser implementations for opaque group types with defined delimiters and the GroupContaining types that parses the surrounding delimiters and content of a group type.
literal: This module provides a set of literal types that can be used to parse and tokenize literals. The literals are parsed from the token stream and can be used to represent the parsed value. unsynn defines only simplified literals, such as integers, characters and strings. The literals here are not full rust syntax, which will be defined in the unsynn-rust crate.
punct: This module contains types for punctuation tokens. These are used to represent single and multi character punctuation tokens. For single character punctuation tokens, there are there are PunctAny, PunctAlone and PunctJoint types. Combined punctuation tokens are represented by Operator. The operator! macro can be used to define custom operators.
rust_types: Parsers for rusts types.

Macros§

keyword: Define types matching keywords.
operator: Define types matching operators (punctuation sequences).

Structs§

AngleTokenTree: Parses either a TokenTree or <...> grouping (which is not a Group as far as proc-macros are concerned).
Attribute: Represents an attribute annotation, typically in the form #[attr].
BraceGroup: A opaque group of tokens within a Brace
BraceGroupContaining: Parseable content within a Brace
BracketGroup: A opaque group of tokens within a Bracket
BracketGroupContaining: Parseable content within a Bracket
Cached: Getting the underlying string expensive as it always allocates a new String. This type caches the string representation of a given entity. Note that this is only reliable for fundamental entities that represent a single token. Spacing between composed tokens is not stable and should be considered informal only.
Cons: Conjunctive A followed by B and optional C and D When C and D are not used, they are set to Nothing.
Delimited: This is used when one wants to parse a list of entities separated by delimiters. The delimiter is optional and can be None eg. when the entity is the last in the list. Usually the delimiter will be some simple punctuation token, but it is not limited to that.
DelimitedVec: Since the delimiter in Delimited<T,D> is optional a Vec<Delimited<T,D>> would parse consecutive values even without delimiters. DelimitedVec<T,D> will stop parsing after the first value without a delimiter.
Discard: Succeeds when the next token matches T. The token will be removed from the stream but not stored. Consequently the ToTokens implementations will panic with a message that it can not be emitted. This can only be used when a token should be present but not stored and never emitted.
DocInner: Represents documentation for an item.
EndOfStream: Matches the end of the stream when no tokens are left.
Enum: Represents an enum definition. e.g., #[repr(u8)] pub enum MyEnum<T> where T: Clone { Variant1, Variant2(T) }.
Error: Error type for parsing.
Except: Succeeds when the next token does not match T. Will not consume any tokens.
Expect: Succeeds when the next token would match T. Will not consume any tokens. This is similar to peeking.
FacetAttr: Represents a facet attribute that can contain specialized metadata.
GenericParams: Represents the generic parameters of a struct or enum definition, enclosed in angle brackets. e.g., <'a, T: Trait, const N: usize>.
Group: A delimited token stream.
GroupContaining: Any kind of Group G with parseable content C. The content C must parse exhaustive, a EndOfStream is automatically implied.
HiddenState: Sometimes one want to compose types or create structures for unsynn that have members that are not part of the parsed syntax but add some additional information. This struct can be used to hold such members while still using the Parser and ToTokens trait implementations automatically generated by the [unsynn!{}] macro or composition syntax. HiddenState will not consume any tokens when parsing and will not emit any tokens when generating a TokenStream. On parsing it is initialized with a default value. It has Deref and DerefMut implemented to access the inner value.
Ident: A word of Rust code, which may be a keyword or legal variable name.
Invalid: A unit that always fails to match. This is useful as default for generics. See how Either<A, B, C, D> uses this for unused alternatives.
InvariantInner: Represents invariants for a type.
KConst: The “const” keyword.
KCrate: The “crate” keyword.
KDoc: The “doc” keyword.
KEnum: The “enum” keyword.
KFacet: The “facet” keyword.
KIn: The “in” keyword.
KInvariants: The “invariants” keyword.
KMut: The “mut” keyword.
KOpaque: The “opaque” keyword.
KPub: The “pub” keyword.
KRepr: The “repr” keyword.
KSensitive: The “sensitive” keyword.
KStruct: The “struct” keyword.
KWhere: The “where” keyword.
LazyVec: A Vec<T> that is filled up to the first appearance of an terminating S. This S may be a subset of T, thus parsing become lazy. This is the same as Cons<Vec<Cons<Except<S>,T>>,S> but more convenient and efficient.
Lifetime: Represents a lifetime annotation, like 'a.
Literal: A literal string ("hello"), byte string (b"hello"), character ('a'), byte character (b'a'), an integer or floating point number with or without a suffix (1, 1u8, 2.3, 2.3f32).
LiteralCharacter: A single quoted character literal ('x').
LiteralInteger: A simple unsigned 128 bit integer. This is the most simple form to parse integers. Note that only decimal integers without any other characters, signs or suffixes are supported, this is not full rust syntax.
LiteralString: A double quoted string literal ("hello"). The quotes are included in the value. Note that this is a simplified string literal, and only double quoted strings are supported, this is not full rust syntax, eg. byte and C string literals are not supported.
NonEmptyTokenStream: Since parsing a TokenStream succeeds even when no tokens are left, this type is used to parse a TokenStream that is not empty.
NoneGroup: A opaque group of tokens within a None
NoneGroupContaining: Parseable content within a None
Nothing: A unit that always matches without consuming any tokens. This is required when one wants to parse a Repeats without a delimiter. Note that using Nothing as primary entity in a Vec, LazyVec, DelimitedVec or Repeats will result in an infinite loop.
Operator: Operators made from up to four ASCII punctuation characters. Unused characters default to \0. Custom operators can be defined with the operator! macro. All but the last character are Spacing::Joint. Attention must be payed when operators have the same prefix, the shorter ones need to be tried first.
ParenthesisGroup: A opaque group of tokens within a Parenthesis
ParenthesisGroupContaining: Parseable content within a Parenthesis
Punct: A Punct is a single punctuation character like +, - or #.
PunctAlone: A single character punctuation token which is not followed by another punctuation character.
PunctAny: A single character punctuation token with any kind of Spacing,
PunctJoint: A single character punctuation token where the lexer joined it with the next Punct or a single quote followed by a identifier (rust lifetime).
Repeats: Like DelimitedVec<T,D> but with a minimum and maximum (inclusive) number of elements. Parsing will succeed when at least the minimum number of elements is reached and stop at the maximum number. The delimiter D defaults to Nothing to parse sequences which don’t have delimiters.
ReprInner: Represents the inner content of a repr attribute, typically used for specifying memory layout or representation hints.
Skip: Skips over expected tokens. Will parse and consume the tokens but not store them. Consequently the ToTokens implementations will not output any tokens.
Span: A region of source code, along with macro expansion information.
Struct: Represents a struct definition.
StructField: Represents a field within a regular struct definition. e.g., pub name: String.
StructVariant: Represents a struct-like enum variant. e.g., MyVariant { field1: u32, field2: String }.
TokenStream: An abstract stream of tokens, or more concretely a sequence of token trees.
TupleField: Represents a field within a tuple struct definition. e.g., pub String.
TupleVariant: Represents a tuple-like enum variant. e.g., MyVariant(u32, String).
UnitVariant: Represents a unit-like enum variant. e.g., MyVariant.
VerbatimDisplay: Display the verbatim tokens until the given token.
WhereClause: Represents a single predicate within a where clause. e.g., T: Trait or 'a: 'b.
WhereClauses: Represents a where clause attached to a definition. e.g., where T: Trait, 'a: 'b.

Enums§

AdtDecl: Represents an algebraic data type (ADT) declaration, which can be either a struct or enum.
AttributeInner: Represents the inner content of an attribute annotation.
ConstOrMut: Represents either the const or mut keyword, often used with pointers.
Delimiter: Describes how a sequence of token trees is delimited.
Either: Disjunctive A or B or optional C or D tried in that order. When C and D are not used, they are set to Invalid.
EnumVariantLike: Represents the different kinds of variants an enum can have.
ErrorKind: Actual kind of an error.
Expr: Represents a simple expression, currently only integer literals. Used potentially for const generic default values.
FacetInner: Represents the inner content of a facet attribute.
GenericParam: Represents a single generic parameter within a GenericParams list.
Spacing: Whether a Punct is followed immediately by another Punct or followed by another token or whitespace.
StructKind: Represents the kind of a struct definition.
TokenTree: A single token or a delimited sequence of token trees (e.g. [1, (), ..]).
Vis: Represents visibility modifiers for items.

Traits§

GroupDelimiter: Access to the surrounding Delimiter of a GroupContaining and its variants.
IParse: Extension trait for TokenIter that calls Parse::parse().
Parse: This trait provides the user facing API to parse grammatical entities. It is implemented for anything that implements the Parser trait. The methods here encapsulating the iterator that is used for parsing into a transaction. This iterator is always Copy. Instead using a peekable iterator or implementing deeper peeking, parse clones this iterator to make access transactional, when parsing succeeds then the transaction becomes committed, otherwise it is rolled back.
Parser: The Parser trait that must be implemented by anything we want to parse. We are parsing over a TokenIter (proc_macro2::TokenStream iterator).
RangedRepeats: A trait for parsing a repeating T with a minimum and maximum limit. Sometimes the number of elements to be parsed is determined at runtime eg. a number of header items needs a matching number of values.
ToTokens: unsynn defines its own ToTokens trait to be able to implement it for std container types. This is similar to the ToTokens from the quote crate but adds some extra methods and is implemented for more types. Moreover the to_token_iter() method is the main entry point for crating an iterator that can be used for parsing.
TokenCount: We track the position of the error by counting tokens. This trait is implemented for references to shadow counted TokenIter, and usize. The later allows to pass in a position directly or use usize::MAX in case no position data is available (which will make this error the be the final one when upgrading).
Transaction: Helper trait to make TokenIter transactional

Type Aliases§

And: &
AndAnd: &&
AndEq: &=
Any: Any number of T delimited by D or Nothing
Apostrophe: Represents the apostrophe ‘'’ operator.
Assign: =
At: @
AtLeast: At least N of T delimited by D or Nothing
AtMost: At most N of T delimited by D or Nothing
Backslash: \
Bang: !
Bounds: Represents type bounds, consisting of a colon followed by tokens until a comma, equals sign, or closing angle bracket is encountered.
CachedGroup: Group with cached string representation.
CachedIdent: Ident with cached string representation.
CachedLiteral: Literal with cached string representation.
CachedPunct: Punct with cached string representation.
CachedTokenTree: TokenTree (any token) with cached string representation.
Caret: ^
CaretEq: ^=
Colon: :
ColonDelimited: T followed by an optional :
ColonDelimitedVec: Vector of T delimited by :
Comma: ,
CommaDelimited: T followed by an optional ,
CommaDelimitedVec: Vector of T delimited by ,
Dollar: $
Dot: .
DotDelimited: T followed by an optional .
DotDelimitedVec: Vector of T delimited by .
DotDot: ..
DotDotEq: ..=
DoubleSemicolon: Represents the double semicolon ‘::’ operator.
Ellipsis: ...
Eq: Represents the ‘=’ operator.
Equal: ==
Exactly: Exactly N of T delimited by D or Nothing
FatArrow: =>
Ge: >=
Gt: >
LArrow: <-
Le: <=
LifetimeTick: ' With Spacing::Joint
Lt: <
Many: One or more of T delimited by D or Nothing
Minus: -
MinusEq: -=
ModPath: Represents a module path, consisting of an optional path separator followed by a path-separator-delimited sequence of identifiers.
NotEqual: !=
Optional: Zero or one of T delimited by D or Nothing
Or: |
OrEq: |=
OrOr: ||
PathSep: ::
PathSepDelimited: T followed by an optional ::
PathSepDelimitedVec: Vector of T delimited by ::
Percent: %
PercentEq: %=
Plus: +
PlusEq: +=
Pound: #
Question: ?
RArrow: ->
Result: Result type for parsing.
Semi: Represents the ‘;’ operator.
Semicolon: ;
SemicolonDelimited: T followed by an optional ;
SemicolonDelimitedVec: Vector of T delimited by ;
Shl: <<
ShlEq: <<=
Shr: >>
ShrEq: >>=
Slash: /
SlashEq: /=
Star: *
StarEq: *=
Tilde: ~
TokenIter: Type alias for the iterator type we use for parsing. This Iterator is Clone and produces &TokenTree. The shadow counter counts tokens in the background to track progress which is used to keep the error that made the most progress in disjunctive parsers.
Underscore: _
VerbatimUntil: Parses tokens and groups until C is found on the current token tree level.

Crate facet_derive_parseCopy item path