Expand description
Syn is a parsing library for parsing a stream of Rust tokens into a syntax tree of Rust source code.
Currently this library is geared toward the custom derive use case but contains some APIs that may be useful for Rust procedural macros more generally.
-
Data structures — Syn provides a complete syntax tree that can represent any valid Rust source code. The syntax tree is rooted at
syn::File
which represents a full source file, but there are other entry points that may be useful to procedural macros includingsyn::Item
,syn::Expr
andsyn::Type
. -
Custom derives — Of particular interest to custom derives is
syn::DeriveInput
which is any of the three legal input items to a derive macro. An example below shows using this type in a library that can derive implementations of a trait of your own. -
Parser combinators — Parsing in Syn is built on a suite of public parser combinator macros that you can use for parsing any token-based syntax you dream up within a
functionlike!(...)
procedural macro. Every syntax tree node defined by Syn is individually parsable and may be used as a building block for custom syntaxes, or you may do it all yourself working from the most primitive tokens. -
Location information — Every token parsed by Syn is associated with a
Span
that tracks line and column information back to the source of that token. These spans allow a procedural macro to display detailed error messages pointing to all the right places in the user’s code. There is an example of this below. -
Feature flags — Functionality is aggressively feature gated so your procedural macros enable only what they need, and do not pay in compile time for all the rest.
Version requirement: Syn supports any compiler version back to Rust’s very first support for procedural macros in Rust 1.15.0. Some features especially around error reporting are only available in newer compilers or on the nightly channel.
§Example of a custom derive
The canonical custom derive using Syn looks like this. We write an ordinary
Rust function tagged with a proc_macro_derive
attribute and the name of
the trait we are deriving. Any time that derive appears in the user’s code,
the Rust compiler passes their data structure as tokens into our macro. We
get to execute arbitrary Rust code to figure out what to do with those
tokens, then hand some tokens back to the compiler to compile into the
user’s crate.
[dependencies]
syn = "0.12"
quote = "0.4"
[lib]
proc-macro = true
extern crate proc_macro;
extern crate syn;
#[macro_use]
extern crate quote;
use proc_macro::TokenStream;
use syn::DeriveInput;
#[proc_macro_derive(MyMacro)]
pub fn my_macro(input: TokenStream) -> TokenStream {
// Parse the input tokens into a syntax tree
let input: DeriveInput = syn::parse(input).unwrap();
// Build the output, possibly using quasi-quotation
let expanded = quote! {
// ...
};
// Hand the output tokens back to the compiler
expanded.into()
}
The heapsize
example directory shows a complete working Macros 1.1
implementation of a custom derive. It works on any Rust compiler >=1.15.0.
The example derives a HeapSize
trait which computes an estimate of the
amount of heap memory owned by a value.
pub trait HeapSize {
/// Total number of bytes of heap memory owned by `self`.
fn heap_size_of_children(&self) -> usize;
}
The custom derive allows users to write #[derive(HeapSize)]
on data
structures in their program.
#[derive(HeapSize)]
struct Demo<'a, T: ?Sized> {
a: Box<T>,
b: u8,
c: &'a str,
d: String,
}
§Spans and error reporting
The heapsize2
example directory is an extension of the heapsize
example that demonstrates some of the hygiene and error reporting properties
of Macros 2.0. This example currently requires a nightly Rust compiler
>=1.24.0-nightly but we are working to stabilize all of the APIs involved.
The token-based procedural macro API provides great control over where the
compiler’s error messages are displayed in user code. Consider the error the
user sees if one of their field types does not implement HeapSize
.
#[derive(HeapSize)]
struct Broken {
ok: String,
bad: std::thread::Thread,
}
In the Macros 1.1 string-based procedural macro world, the resulting error would point unhelpfully to the invocation of the derive macro and not to the actual problematic field.
error[E0599]: no method named `heap_size_of_children` found for type `std::thread::Thread` in the current scope
--> src/main.rs:4:10
|
4 | #[derive(HeapSize)]
| ^^^^^^^^
By tracking span information all the way through the expansion of a
procedural macro as shown in the heapsize2
example, token-based macros in
Syn are able to trigger errors that directly pinpoint the source of the
problem.
error[E0277]: the trait bound `std::thread::Thread: HeapSize` is not satisfied
--> src/main.rs:7:5
|
7 | bad: std::thread::Thread,
| ^^^^^^^^^^^^^^^^^^^^^^^^ the trait `HeapSize` is not implemented for `Thread`
§Parsing a custom syntax using combinators
The lazy-static
example directory shows the implementation of a
functionlike!(...)
procedural macro in which the input tokens are parsed
using nom
-style parser combinators.
The example reimplements the popular lazy_static
crate from crates.io as a
procedural macro.
lazy_static! {
static ref USERNAME: Regex = Regex::new("^[a-z0-9_-]{3,16}$").unwrap();
}
The implementation shows how to trigger custom warnings and error messages on the macro input.
warning: come on, pick a more creative name
--> src/main.rs:10:16
|
10 | static ref FOO: String = "lazy_static".to_owned();
| ^^^
§Debugging
When developing a procedural macro it can be helpful to look at what the
generated code looks like. Use cargo rustc -- -Zunstable-options --pretty=expanded
or the cargo expand
subcommand.
To show the expanded code for some crate that uses your procedural macro,
run cargo expand
from that crate. To show the expanded code for one of
your own test cases, run cargo expand --test the_test_case
where the last
argument is the name of the test file without the .rs
extension.
This write-up by Brandon W Maister discusses debugging in more detail: Debugging Rust’s new Custom Derive system.
§Optional features
Syn puts a lot of functionality behind optional features in order to optimize compile time for the most common use cases. The following features are available.
derive
(enabled by default) — Data structures for representing the possible input to a custom derive, including structs and enums and types.full
— Data structures for representing the syntax tree of all valid Rust source code, including items and expressions.parsing
(enabled by default) — Ability to parse input tokens into a syntax tree node of a chosen type.printing
(enabled by default) — Ability to print a syntax tree node as tokens of Rust source code.visit
— Trait for traversing a syntax tree.visit-mut
— Trait for traversing and mutating in place a syntax tree.fold
— Trait for transforming an owned syntax tree.clone-impls
(enabled by default) — Clone impls for all syntax tree types.extra-traits
— Debug, Eq, PartialEq, Hash impls for all syntax tree types.
Modules§
- buffer
- A stably addressed token buffer supporting efficient traversal based on a cheaply copyable cursor.
- fold
- Syntax tree traversal to transform the nodes of an owned syntax tree.
- punctuated
- A punctuated sequence of syntax tree nodes separated by punctuation.
- spanned
- A trait that can provide the
Span
of the complete contents of a syntax tree node. - synom
- Parsing interface for parsing a token stream into a syntax tree node.
- token
- Tokens representing Rust punctuation, keywords, and delimiters.
- visit
- Syntax tree traversal to walk a shared borrow of a syntax tree.
- visit_
mut - Syntax tree traversal to mutate an exclusive borrow of a syntax tree in place.
Macros§
- Token
- A type-macro that expands to the name of the Rust type representation of a given token.
- alt
- Run a series of parsers, returning the result of the first one which succeeds.
- braces
- Parse inside of
{
}
curly braces. - brackets
- Parse inside of
[
]
square brackets. - call
- Invoke the given parser function with zero or more arguments.
- cond
- Execute a parser only if a condition is met, otherwise return None.
- cond_
reduce - Execute a parser only if a condition is met, otherwise fail to parse.
- do_
parse - Run a series of parsers, optionally naming each intermediate result, followed by a step to combine the intermediate results.
- epsilon
- Parses nothing and always succeeds.
- input_
end - Parse nothing and succeed only if the end of the enclosing block has been reached.
- keyword
- Parse a single Rust keyword token.
- many0
- Parse zero or more values using the given parser.
- map
- Transform the result of a parser by applying a function or closure.
- named
- Define a parser function with the signature expected by syn parser combinators.
- not
- Invert the result of a parser by parsing successfully if the given parser fails to parse and vice versa.
- option
- Turn a failed parse into
None
and a successful parse intoSome
. - parens
- Parse inside of
(
)
parentheses. - parse_
quote - Quasi-quotation macro that accepts input like the
quote!
macro but uses type inference to figure out a return type for those tokens. - punct
- Parse a single Rust punctuation token.
- reject
- Unconditionally fail to parse anything.
- switch
- Pattern-match the result of a parser to select which other parser to run.
- syn
- Parse any type that implements the
Synom
trait. - tuple
- Run a series of parsers and produce all of the results in a tuple.
- value
- Produce the given value without parsing anything.
Structs§
- Abi
- The binary interface of a function:
extern "C"
. - Angle
Bracketed Generic Arguments - Angle bracketed arguments of a path segment: the
<K, V>
inHashMap<K, V>
. - ArgCaptured
- An explicitly typed pattern captured by a function signature.
- ArgSelf
- Self captured by value in a function signature:
self
ormut self
. - ArgSelf
Ref - Self captured by reference in a function signature:
&self
or&mut self
. - Arm
- One arm of a
match
expression:0...10 => { return true; }
. - Attribute
- An attribute like
#[repr(transparent)]
. - Bare
FnArg - An argument in a function type: the
usize
infn(usize) -> bool
. - Binding
- A binding (equality constraint) on an associated type:
Item = u8
. - Block
- A braced block containing Rust statements.
- Bound
Lifetimes - A set of bound lifetimes:
for<'a, 'b, 'c>
. - Const
Param - A const generic parameter:
const LENGTH: usize
. - Data
Enum - An enum input to a
proc_macro_derive
macro. - Data
Struct - A struct input to a
proc_macro_derive
macro. - Data
Union - A tagged union input to a
proc_macro_derive
macro. - Derive
Input - Data structure sent to a
proc_macro_derive
macro. - Expr
Addr Of - A referencing operation:
&a
or&mut a
. - Expr
Array - A slice literal expression:
[a, b, c, d]
. - Expr
Assign - An assignment expression:
a = compute()
. - Expr
Assign Op - A compound assignment expression:
counter += 1
. - Expr
Binary - A binary operation:
a + b
,a * b
. - Expr
Block - A blocked scope:
{ ... }
. - ExprBox
- A box expression:
box f
. - Expr
Break - A
break
, with an optional label to break and an optional expression. - Expr
Call - A function call expression:
invoke(a, b)
. - Expr
Cast - A cast expression:
foo as f64
. - Expr
Catch - A catch expression:
do catch { ... }
. - Expr
Closure - A closure expression:
|a, b| a + b
. - Expr
Continue - A
continue
, with an optional label. - Expr
Field - Access of a named struct field (
obj.k
) or unnamed tuple struct field (obj.0
). - Expr
ForLoop - A for loop:
for pat in expr { ... }
. - Expr
Group - An expression contained within invisible delimiters.
- ExprIf
- An
if
expression with an optionalelse
block:if expr { ... } else { ... }
. - Expr
IfLet - An
if let
expression with an optionalelse
block:if let pat = expr { ... } else { ... }
. - Expr
InPlace - A placement expression:
place <- value
. - Expr
Index - A square bracketed indexing expression:
vector[2]
. - ExprLit
- A literal in place of an expression:
1
,"foo"
. - Expr
Loop - Conditionless loop:
loop { ... }
. - Expr
Macro - A macro invocation expression:
format!("{}", q)
. - Expr
Match - A
match
expression:match n { Some(n) => {}, None => {} }
. - Expr
Method Call - A method call expression:
x.foo::<T>(a, b)
. - Expr
Paren - A parenthesized expression:
(a + b)
. - Expr
Path - A path like
std::mem::replace
possibly containing generic parameters and a qualified self-type. - Expr
Range - A range expression:
1..2
,1..
,..2
,1..=2
,..=2
. - Expr
Repeat - An array literal constructed from one repeated element:
[0u8; N]
. - Expr
Return - A
return
, with an optional value to be returned. - Expr
Struct - A struct literal expression:
Point { x: 1, y: 1 }
. - ExprTry
- A try-expression:
expr?
. - Expr
Tuple - A tuple expression:
(a, b, c, d)
. - Expr
Type - A type ascription expression:
foo: f64
. - Expr
Unary - A unary operation:
!x
,*x
. - Expr
Unsafe - An unsafe block:
unsafe { ... }
. - Expr
Verbatim - Tokens in expression position not interpreted by Syn.
- Expr
While - A while loop:
while expr { ... }
. - Expr
While Let - A while-let loop:
while let pat = expr { ... }
. - Expr
Yield - A yield expression:
yield expr
. - Field
- A field of a struct or enum variant.
- Field
Pat - A single field in a struct pattern.
- Field
Value - A field-value pair in a struct literal.
- Fields
Named - Named fields of a struct or struct variant such as
Point { x: f64, y: f64 }
. - Fields
Unnamed - Unnamed fields of a tuple struct or tuple variant such as
Some(T)
. - File
- A complete file of Rust source code.
- FnDecl
- Header of a function declaration, without including the body.
- Foreign
Item Fn - A foreign function in an
extern
block. - Foreign
Item Static - A foreign static item in an
extern
block:static ext: u8
. - Foreign
Item Type - A foreign type in an
extern
block:type void
. - Foreign
Item Verbatim - Tokens in an
extern
block not interpreted by Syn. - Generics
- Lifetimes and type parameters attached to a declaration of a function, enum, trait, etc.
- Ident
- A word of Rust code, which may be a keyword or legal variable name.
- Impl
Generics - Returned by
Generics::split_for_impl
. - Impl
Item Const - An associated constant within an impl block.
- Impl
Item Macro - A macro invocation within an impl block.
- Impl
Item Method - A method within an impl block.
- Impl
Item Type - An associated type within an impl block.
- Impl
Item Verbatim - Tokens within an impl block not interpreted by Syn.
- Index
- The index of an unnamed tuple struct field.
- Item
Const - A constant item:
const MAX: u16 = 65535
. - Item
Enum - An enum definition:
enum Foo<A, B> { C<A>, D<B> }
. - Item
Extern Crate - An
extern crate
item:extern crate serde
. - ItemFn
- A free-standing function:
fn process(n: usize) -> Result<()> { ... }
. - Item
Foreign Mod - A block of foreign items:
extern "C" { ... }
. - Item
Impl - An impl block providing trait or associated items:
impl<A> Trait for Data<A> { ... }
. - Item
Macro - A macro invocation, which includes
macro_rules!
definitions. - Item
Macro2 - A 2.0-style declarative macro introduced by the
macro
keyword. - ItemMod
- A module or module declaration:
mod m
ormod m { ... }
. - Item
Static - A static item:
static BIKE: Shed = Shed(42)
. - Item
Struct - A struct definition:
struct Foo<A> { x: A }
. - Item
Trait - A trait definition:
pub trait Iterator { ... }
. - Item
Type - A type alias:
type Result<T> = std::result::Result<T, MyError>
. - Item
Union - A union definition:
union Foo<A, B> { x: A, y: B }
. - ItemUse
- A use declaration:
use std::collections::HashMap
. - Item
Verbatim - Tokens forming an item not interpreted by Syn.
- Label
- A lifetime labeling a
for
,while
, orloop
. - Lifetime
- A Rust lifetime:
'a
. - Lifetime
Def - A lifetime definition:
'a: 'b + 'c + 'd
. - LitBool
- A boolean literal:
true
orfalse
. - LitByte
- A byte literal:
b'f'
. - LitByte
Str - A byte string literal:
b"foo"
. - LitChar
- A character literal:
'a'
. - LitFloat
- A floating point literal:
1f64
or1.0e10f64
. - LitInt
- An integer literal:
1
or1u16
. - LitStr
- A UTF-8 string literal:
"foo"
. - LitVerbatim
- A raw token literal not interpreted by Syn, possibly because it represents an integer larger than 64 bits.
- Local
- A local
let
binding:let x: u64 = s.parse()?
. - Macro
- A macro invocation:
println!("{}", mac)
. - Meta
List - A structured list within an attribute, like
derive(Copy, Clone)
. - Meta
Name Value - A name-value pair within an attribute, like
feature = "nightly"
. - Method
Sig - A method’s signature in a trait or implementation:
unsafe fn initialize(&self)
. - Method
Turbofish - The
::<>
explicit type parameters passed to a method call:parse::<u64>()
. - Parenthesized
Generic Arguments - Arguments of a function path segment: the
(A, B) -> C
inFn(A,B) -> C
. - PatBox
- A box pattern:
box v
. - PatIdent
- A pattern that binds a new variable:
ref mut binding @ SUBPATTERN
. - PatLit
- A literal pattern:
0
. - PatMacro
- A macro in expression position.
- PatPath
- A path pattern like
Color::Red
, optionally qualified with a self-type. - PatRange
- A range pattern:
1..=2
. - PatRef
- A reference pattern:
&mut (first, second)
. - PatSlice
- A dynamically sized slice pattern:
[a, b, i.., y, z]
. - PatStruct
- A struct or struct variant pattern:
Variant { x, y, .. }
. - PatTuple
- A tuple pattern:
(a, b)
. - PatTuple
Struct - A tuple struct or tuple variant pattern:
Variant(x, y, .., z)
. - PatVerbatim
- Tokens in pattern position not interpreted by Syn.
- PatWild
- A pattern that matches any value:
_
. - Path
- A path at which a named item is exported:
std::collections::HashMap
. - Path
Segment - A segment of a path together with any path arguments on that segment.
- Path
Tokens - A helper for printing a self-type qualified path as tokens.
- Predicate
Eq - An equality predicate in a
where
clause (unsupported). - Predicate
Lifetime - A lifetime predicate in a
where
clause:'a: 'b + 'c
. - Predicate
Type - A type predicate in a
where
clause:for<'c> Foo<'c>: Trait<'c>
. - QSelf
- The explicit Self type in a qualified path: the
T
in<T as Display>::fmt
. - Trait
Bound - A trait used as a bound on a type parameter.
- Trait
Item Const - An associated constant within the definition of a trait.
- Trait
Item Macro - A macro invocation within the definition of a trait.
- Trait
Item Method - A trait method within the definition of a trait.
- Trait
Item Type - An associated type within the definition of a trait.
- Trait
Item Verbatim - Tokens within the definition of a trait not interpreted by Syn.
- Turbofish
- Returned by
TypeGenerics::as_turbofish
. - Type
Array - A fixed size array type:
[T; n]
. - Type
Bare Fn - A bare function type:
fn(usize) -> bool
. - Type
Generics - Returned by
Generics::split_for_impl
. - Type
Group - A type contained within invisible delimiters.
- Type
Impl Trait - An
impl Bound1 + Bound2 + Bound3
type whereBound
is a trait or a lifetime. - Type
Infer - Indication that a type should be inferred by the compiler:
_
. - Type
Macro - A macro in the type position.
- Type
Never - The never type:
!
. - Type
Param - A generic type parameter:
T: Into<String>
. - Type
Paren - A parenthesized type equivalent to the inner type.
- Type
Path - A path like
std::slice::Iter
, optionally qualified with a self-type as in<Vec<T> as SomeTrait>::Associated
. - TypePtr
- A raw pointer type:
*const T
or*mut T
. - Type
Reference - A reference type:
&'a T
or&'a mut T
. - Type
Slice - A dynamically sized slice type:
[T]
. - Type
Trait Object - A trait object type
Bound1 + Bound2 + Bound3
whereBound
is a trait or a lifetime. - Type
Tuple - A tuple type:
(A, B, C, String)
. - Type
Verbatim - Tokens in type position not interpreted by Syn.
- UseGlob
- A glob import in a
use
item:*
. - UseList
- A braced list of imports in a
use
item:{A, B, C}
. - UsePath
- An identifier imported by a
use
item:Type
orType as Renamed
. - Variant
- An enum variant.
- VisCrate
- A crate-level visibility:
pub(crate)
. - VisPublic
- A public visibility level:
pub
. - VisRestricted
- A visibility level restricted to some path:
pub(self)
orpub(super)
orpub(in some::module)
. - Where
Clause - A
where
clause in a definition:where T: Deserialize<'de>, D: 'static
.
Enums§
- Attr
Style - Distinguishes between attributes that decorate an item and attributes that are contained within an item.
- Bare
FnArg Name - Name of an argument in a function type: the
n
infn(n: usize)
. - BinOp
- A binary operator:
+
,+=
,&
. - Data
- The storage of a struct, enum or union data structure.
- Expr
- A Rust expression.
- Fields
- Data stored within an enum variant or struct.
- Float
Suffix - The suffix on a floating point literal if any, like the
f32
in1.0f32
. - FnArg
- An argument in a function signature: the
n: usize
infn f(n: usize)
. - Foreign
Item - An item within an
extern
block. - Generic
Argument - An individual generic argument, like
'a
,T
, orItem = T
. - Generic
Method Argument - An individual generic argument to a method, like
T
. - Generic
Param - A generic type parameter, lifetime, or const generic:
T: Into<String>
,'a: 'b
,const LEN: usize
. - Impl
Item - An item within an impl block.
- IntSuffix
- The suffix on an integer literal if any, like the
u8
in127u8
. - Item
- Things that can appear directly inside of a module or scope.
- Lit
- A Rust literal such as a string or integer or boolean.
- Macro
Delimiter - A grouping token that surrounds a macro body:
m!(...)
orm!{...}
orm![...]
. - Member
- A struct or tuple struct field accessed in a struct literal or field expression.
- Meta
- Content of a compile-time structured attribute.
- Nested
Meta - Element of a compile-time attribute list.
- Pat
- A pattern in a local binding, function signature, match expression, or various other places.
- Path
Arguments - Angle bracketed or parenthesized arguments of a path segment.
- Range
Limits - Limit types of a range, inclusive or exclusive.
- Return
Type - Return type of a function signature.
- Stmt
- A statement, usually ending in a semicolon.
- StrStyle
- The style of a string literal, either plain quoted or a raw string like
r##"data"##
. - Trait
Bound Modifier - A modifier on a trait bound, currently only used for the
?
in?Sized
. - Trait
Item - An item declaration within the definition of a trait.
- Type
- The possible types that a Rust value could have.
- Type
Param Bound - A trait or lifetime used as a bound on a type parameter.
- UnOp
- A unary operator:
*
,!
,-
. - UseTree
- A suffix of an import tree in a
use
item:Type as Renamed
or*
. - Visibility
- The visibility level of an item: inherited or
pub
orpub(restricted)
. - Where
Predicate - A single predicate in a
where
clause:T: Deserialize<'de>
.
Functions§
- parse
- Parse tokens of source code into the chosen syntax tree node.
- parse2
- Parse a proc-macro2 token stream into the chosen syntax tree node.
- parse_
file - Parse the content of a file of Rust code.
- parse_
str - Parse a string of Rust code into the chosen syntax tree node.