Expand description
Syn is a parsing library for parsing a stream of Rust tokens into a syntax tree of Rust source code.
Currently this library is geared toward use in Rust procedural macros, but contains some APIs that may be useful more generally.
-
Data structures — Syn provides a complete syntax tree that can represent any valid Rust source code. The syntax tree is rooted at
syn::File
which represents a full source file, but there are other entry points that may be useful to procedural macros includingsyn::Item
,syn::Expr
andsyn::Type
. -
Custom derives — Of particular interest to custom derives is
syn::DeriveInput
which is any of the three legal input items to a derive macro. An example below shows using this type in a library that can derive implementations of a trait of your own. -
Parsing — Parsing in Syn is built around parser functions with the signature
fn(ParseStream) -> Result<T>
. Every syntax tree node defined by Syn is individually parsable and may be used as a building block for custom syntaxes, or you may dream up your own brand new syntax without involving any of our syntax tree types. -
Location information — Every token parsed by Syn is associated with a
Span
that tracks line and column information back to the source of that token. These spans allow a procedural macro to display detailed error messages pointing to all the right places in the user’s code. There is an example of this below. -
Feature flags — Functionality is aggressively feature gated so your procedural macros enable only what they need, and do not pay in compile time for all the rest.
Version requirement: Syn supports any compiler version back to Rust’s very first support for procedural macros in Rust 1.15.0. Some features especially around error reporting are only available in newer compilers or on the nightly channel.
Example of a custom derive
The canonical custom derive using Syn looks like this. We write an ordinary
Rust function tagged with a proc_macro_derive
attribute and the name of
the trait we are deriving. Any time that derive appears in the user’s code,
the Rust compiler passes their data structure as tokens into our macro. We
get to execute arbitrary Rust code to figure out what to do with those
tokens, then hand some tokens back to the compiler to compile into the
user’s crate.
[dependencies]
syn = "0.15"
quote = "0.6"
[lib]
proc-macro = true
#[macro_use]
extern crate quote;
#[macro_use]
extern crate syn;
extern crate proc_macro;
use proc_macro::TokenStream;
use syn::DeriveInput;
#[proc_macro_derive(MyMacro)]
pub fn my_macro(input: TokenStream) -> TokenStream {
// Parse the input tokens into a syntax tree
let input = parse_macro_input!(input as DeriveInput);
// Build the output, possibly using quasi-quotation
let expanded = quote! {
// ...
};
// Hand the output tokens back to the compiler
TokenStream::from(expanded)
}
The heapsize
example directory shows a complete working Macros 1.1
implementation of a custom derive. It works on any Rust compiler >=1.15.0.
The example derives a HeapSize
trait which computes an estimate of the
amount of heap memory owned by a value.
pub trait HeapSize {
/// Total number of bytes of heap memory owned by `self`.
fn heap_size_of_children(&self) -> usize;
}
The custom derive allows users to write #[derive(HeapSize)]
on data
structures in their program.
#[derive(HeapSize)]
struct Demo<'a, T: ?Sized> {
a: Box<T>,
b: u8,
c: &'a str,
d: String,
}
Spans and error reporting
The heapsize2
example directory is an extension of the heapsize
example that demonstrates some of the hygiene and error reporting properties
of Macros 2.0. This example currently requires a nightly Rust compiler
>=1.24.0-nightly but we are working to stabilize all of the APIs involved.
The token-based procedural macro API provides great control over where the
compiler’s error messages are displayed in user code. Consider the error the
user sees if one of their field types does not implement HeapSize
.
#[derive(HeapSize)]
struct Broken {
ok: String,
bad: std::thread::Thread,
}
In the Macros 1.1 string-based procedural macro world, the resulting error would point unhelpfully to the invocation of the derive macro and not to the actual problematic field.
error[E0599]: no method named `heap_size_of_children` found for type `std::thread::Thread` in the current scope
--> src/main.rs:4:10
|
4 | #[derive(HeapSize)]
| ^^^^^^^^
By tracking span information all the way through the expansion of a
procedural macro as shown in the heapsize2
example, token-based macros in
Syn are able to trigger errors that directly pinpoint the source of the
problem.
error[E0277]: the trait bound `std::thread::Thread: HeapSize` is not satisfied
--> src/main.rs:7:5
|
7 | bad: std::thread::Thread,
| ^^^^^^^^^^^^^^^^^^^^^^^^ the trait `HeapSize` is not implemented for `Thread`
Parsing a custom syntax
The lazy-static
example directory shows the implementation of a
functionlike!(...)
procedural macro in which the input tokens are parsed
using Syn’s parsing API.
The example reimplements the popular lazy_static
crate from crates.io as a
procedural macro.
lazy_static! {
static ref USERNAME: Regex = Regex::new("^[a-z0-9_-]{3,16}$").unwrap();
}
The implementation shows how to trigger custom warnings and error messages on the macro input.
warning: come on, pick a more creative name
--> src/main.rs:10:16
|
10 | static ref FOO: String = "lazy_static".to_owned();
| ^^^
Debugging
When developing a procedural macro it can be helpful to look at what the
generated code looks like. Use cargo rustc -- -Zunstable-options --pretty=expanded
or the cargo expand
subcommand.
To show the expanded code for some crate that uses your procedural macro,
run cargo expand
from that crate. To show the expanded code for one of
your own test cases, run cargo expand --test the_test_case
where the last
argument is the name of the test file without the .rs
extension.
This write-up by Brandon W Maister discusses debugging in more detail: Debugging Rust’s new Custom Derive system.
Optional features
Syn puts a lot of functionality behind optional features in order to optimize compile time for the most common use cases. The following features are available.
derive
(enabled by default) — Data structures for representing the possible input to a custom derive, including structs and enums and types.full
— Data structures for representing the syntax tree of all valid Rust source code, including items and expressions.parsing
(enabled by default) — Ability to parse input tokens into a syntax tree node of a chosen type.printing
(enabled by default) — Ability to print a syntax tree node as tokens of Rust source code.visit
— Trait for traversing a syntax tree.visit-mut
— Trait for traversing and mutating in place a syntax tree.fold
— Trait for transforming an owned syntax tree.clone-impls
(enabled by default) — Clone impls for all syntax tree types.extra-traits
— Debug, Eq, PartialEq, Hash impls for all syntax tree types.proc-macro
(enabled by default) — Runtime dependency on the dynamic library libproc_macro from rustc toolchain.
Modules
Span
of the complete contents of a syntax
tree node.Macros
quote!
macro but uses
type inference to figure out a return type for those tokens.Structs
extern "C"
.<K, V>
in HashMap<K, V>
.self
or mut self
.&self
or &mut self
.match
expression: 0...10 => { return true; }
.#[repr(transparent)]
.usize
in fn(usize) -> bool
.Item = u8
.for<'a, 'b, 'c>
.const LENGTH: usize
.Iterator<Item: Display>
.proc_macro_derive
macro.proc_macro_derive
macro.proc_macro_derive
macro.proc_macro_derive
macro.[a, b, c, d]
.a = compute()
.counter += 1
.async { ... }
.a + b
, a * b
.{ ... }
.box f
.break
, with an optional label to break and an optional
expression.invoke(a, b)
.foo as f64
.|a, b| a + b
.continue
, with an optional label.obj.k
) or unnamed tuple struct
field (obj.0
).for pat in expr { ... }
.if
expression with an optional else
block: if expr { ... } else { ... }
.place <- value
.vector[2]
.let
guard: let Some(x) = opt
.1
, "foo"
.loop { ... }
.format!("{}", q)
.match
expression: match n { Some(n) => {}, None => {} }
.x.foo::<T>(a, b)
.(a + b)
.std::mem::replace
possibly containing generic
parameters and a qualified self-type.1..2
, 1..
, ..2
, 1..=2
, ..=2
.&a
or &mut a
.[0u8; N]
.return
, with an optional value to be returned.Point { x: 1, y: 1 }
.expr?
.try { ... }
.(a, b, c, d)
.foo: f64
.!x
, *x
.unsafe { ... }
.while expr { ... }
.yield expr
.Point { x: f64, y: f64 }
.Some(T)
.extern
block.extern
block: static ext: u8
.extern
block: type void
.extern
block not interpreted by Syn.Generics::split_for_impl
.const MAX: u16 = 65535
.enum Foo<A, B> { C<A>, D<B> }
.existential type Iter: Iterator<Item = u8>
.extern crate
item: extern crate serde
.fn process(n: usize) -> Result<()> { ... }
.extern "C" { ... }
.impl<A> Trait for Data<A> { ... }
.macro_rules!
definitions.macro
keyword.mod m
or mod m { ... }
.static BIKE: Shed = Shed(42)
.struct Foo<A> { x: A }
.pub trait Iterator { ... }
.pub trait SharableIterator = Iterator + Sync
.type Result<T> = std::result::Result<T, MyError>
.union Foo<A, B> { x: A, y: B }
.use std::collections::HashMap
.for
, while
, or loop
.'a
.'a: 'b + 'c + 'd
.true
or false
.b'f'
.b"foo"
.'a'
.1f64
or 1.0e10f64
.1
or 1u16
."foo"
.let
binding: let x: u64 = s.parse()?
.println!("{}", mac)
.derive(Copy, Clone)
.feature = "nightly"
.unsafe fn initialize(&self)
.::<>
explicit type parameters passed to a method call:
parse::<u64>()
.(A, B) -> C
in Fn(A,B) -> C
.box v
.ref mut binding @ SUBPATTERN
.0
.Color::Red
, optionally qualified with a
self-type.1..=2
.&mut (first, second)
.[a, b, i.., y, z]
.Variant { x, y, .. }
.(a, b)
.Variant(x, y, .., z)
._
.std::collections::HashMap
.where
clause (unsupported).where
clause: 'a: 'b + 'c
.where
clause: for<'c> Foo<'c>: Trait<'c>
.T
in <T as Display>::fmt
.TypeGenerics::as_turbofish
.[T; n]
.fn(usize) -> bool
.Generics::split_for_impl
.impl Bound1 + Bound2 + Bound3
type where Bound
is a trait or
a lifetime._
.!
.T: Into<String>
.std::slice::Iter
, optionally qualified with a
self-type as in <Vec<T> as SomeTrait>::Associated
.*const T
or *mut T
.&'a T
or &'a mut T
.[T]
.Bound1 + Bound2 + Bound3
where Bound
is a
trait or a lifetime.(A, B, C, String)
.use
item: *
.use
item: {A, B, C}
.use
item: HashMap
.use
item: std::...
.use
item: HashMap as Map
.crate
.pub
.pub(self)
or
pub(super)
or pub(crate)
or pub(in some::module)
.where
clause in a definition: where T: Deserialize<'de>, D: 'static
.Enums
n
in fn(n: usize)
.+
, +=
, &
.f32
in
1.0f32
.n: usize
in fn f(n: usize)
.extern
block.'a
, T
, or Item = T
.T
.T: Into<String>
,
'a: 'b
, const LEN: usize
.u8
in 127u8
.m!(...)
or m!{...}
or m![...]
.r##"data"##
.?
in
?Sized
.*
, !
, -
.use
item: Type as Renamed
or *
.pub
or
pub(restricted)
.where
clause: T: Deserialize<'de>
.