Skip to main content

Tokenizer

Struct Tokenizer 

Source
pub struct Tokenizer { /* private fields */ }
Expand description

Configurable tokenizer with custom operators, keywords, and middleware.

Built-in operators and keywords are always available. Custom registrations take priority over built-ins, checked longest-pattern-first.

let tok = Tokenizer::new()
    .op("=~", Token::Custom("RegexMatch".into()))
    .keyword("where", Token::Custom("Where".into()))
    .transform(|tokens| { /* rewrite and return */ tokens });
let tokens = tok.tokenize("name =~ 'foo.*'")?;

Implementations§

Source§

impl Tokenizer

Source

pub fn new() -> Self

Create a new tokenizer with only built-in operators and keywords.

Source

pub fn op(self, pattern: &str, token: Token) -> Self

Register a custom operator pattern.

The pattern is matched byte-for-byte against the input. Custom ops are checked before built-in ops, longest pattern first.

Source

pub fn keyword(self, word: &str, token: Token) -> Self

Register a custom keyword.

Keywords are matched case-insensitively against identifier tokens. Custom keywords override built-in keywords with the same name.

Source

pub fn transform(self, f: fn(Vec<Token>) -> Vec<Token>) -> Self

Register a transform (middleware) that runs after tokenization.

Transforms run in registration order. Each receives the token stream and returns a new one. Use fn pointers for zero overhead.

Source

pub fn tokenize(&self, input: &str) -> Result<Vec<Token>, String>

Tokenize input using this configuration.

Trait Implementations§

Source§

impl Default for Tokenizer

Source§

fn default() -> Self

Returns the “default value” for a type. Read more

Auto Trait Implementations§

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.