Expand description
§Parsey
parsey is a lightweight framework for creating custom parsers and abstract syntax trees (ASTs).
It provides two key traits: Parser and Ast, which together form the foundation
for building parsers and representing the structure of parsed data.
§Key Features
- Generic Parsing Framework: Abstracts the process of parsing tokens into structured data.
- Customizable AST Nodes: Easily define nodes of your AST by implementing the
Asttrait.
§Getting Started
Let’s implement a simple parser that parses a stream of zero and one tokens into groups of two bits!
§Step 1: Implement the Parser Trait
Define a struct that will serve as your parser. This struct must implement the Parser trait,
which iterates over tokens and produces an AST.
ⓘ
use parsey::{parse, require_next_n, Ast, Parser, TokenStream};
#[derive(Debug, PartialEq)]
pub enum MyToken {
Zero,
One,
}
#[derive(Debug, PartialEq)]
pub struct MyError;
pub struct MyParser {
tokens: Vec<MyToken>,
}
impl Parser<MyToken, MyError> for MyParser {
type Root = Root;
fn expect(
token_stream: &mut TokenStream<Self, MyToken, MyError>,
token: MyToken,
) -> Result<(), MyError> {
if token_stream.peek() == Some(&token) {
token_stream.next();
Ok(())
} else {
Err(MyError)
}
}
}
impl Iterator for MyParser {
type Item = MyToken;
fn next(&mut self) -> Option<Self::Item> {
self.tokens.pop()
}
}
impl From<Vec<MyToken>> for MyParser {
fn from(mut value: Vec<MyToken>) -> Self {
value.reverse();
Self { tokens: value }
}
}§Step 2: Define the AST Nodes
Create the structure for your AST by implementing the Ast trait for each node.
The root node must match the type defined in Parser::Root.
ⓘ
#[derive(Debug, PartialEq)]
pub struct Root(Vec<TwoBit>);
#[derive(Debug, PartialEq)]
pub enum TwoBit {
ZeroZero,
ZeroOne,
OneZero,
OneOne,
}
impl Ast<MyToken, MyError> for Root {
fn parse<P>(token_stream: &mut TokenStream<P, MyToken, MyError>) -> Result<Self, MyError>
where
P: Parser<MyToken, MyError>,
{
let mut two_bits = vec![];
while !token_stream.is_empty() {
two_bits.push(TwoBit::parse(token_stream)?);
}
Ok(Self(two_bits))
}
}
impl Ast<MyToken, MyError> for TwoBit {
fn parse<P>(token_stream: &mut TokenStream<P, MyToken, MyError>) -> Result<Self, MyError>
where
P: parsey::Parser<MyToken, MyError>,
{
use MyToken::*;
use TwoBit::*;
match require_next_n!(token_stream, 2, MyError) {
[Zero, Zero] => Ok(ZeroZero),
[Zero, One] => Ok(ZeroOne),
[One, Zero] => Ok(OneZero),
[One, One] => Ok(OneOne),
}
}
}§Step 3: Parse Tokens
Use your parser to parse a sequence of tokens into an AST.
ⓘ
use MyToken::{One, Zero};
use TwoBit::{OneOne, OneZero, ZeroOne, ZeroZero};
let tokens = vec![Zero, Zero, Zero, One, One, Zero, One, One];
let ast = parse::<MyParser, MyToken, MyError>(tokens);
assert_eq!(ast, Ok(Root(vec![ZeroZero, ZeroOne, OneZero, OneOne])));Macros§
- next_n
- Get the next
ntokens fromtoken_stream. - peek_n
- Peek at the next
ntokens fromtoken_streamwithout consuming them. - require_
next_ n - Get the next
ntokens fromtoken_streamor return the provided error if the token stream ends before the required amount of tokens are consumed. - require_
peek_ n - Peek at the next
ntokens fromtoken_streamor return the provided error if the token stream ends before the required amount of tokens are peeked.
Structs§
- Token
Stream - A wrapper around a peekable parser that provides lookahead functionality.
Traits§
- Ast
- A trait representing a component of an abstract syntax tree (AST).
- Parser
- A trait representing a generic parser that consumes tokens and produces an AST.
Functions§
- parse
- Parse a vec of tokens into the provided root AST node.