Skip to main content

Tokenizer

Trait Tokenizer 

Source
pub trait Tokenizer:
    'static
    + Clone
    + Send
    + Sync {
    type TokenStream<'a>: TokenStream;

    // Required method
    fn token_stream<'a>(&'a mut self, text: &'a str) -> Self::TokenStream<'a>;
}
Expand description

Tokenizer splits text into a token stream.

Required Associated Types§

Source

type TokenStream<'a>: TokenStream

Stream type emitted by the tokenizer.

Required Methods§

Source

fn token_stream<'a>(&'a mut self, text: &'a str) -> Self::TokenStream<'a>

Create a token stream for the provided text.

Dyn Compatibility§

This trait is not dyn compatible.

In older versions of Rust, dyn compatibility was called "object safety", so this trait is not object safe.

Implementations on Foreign Types§

Source§

impl Tokenizer for Box<dyn BoxableTokenizer>

Source§

type TokenStream<'a> = BoxTokenStream<'a>

Source§

fn token_stream<'a>(&'a mut self, text: &'a str) -> Self::TokenStream<'a>

Implementors§

Source§

impl Tokenizer for NgramTokenizer

Source§

type TokenStream<'a> = NgramTokenStream<'a>

Source§

impl Tokenizer for RawTokenizer

Source§

type TokenStream<'a> = RawTokenStream<'a>

Source§

impl Tokenizer for SimpleTokenizer

Source§

impl Tokenizer for WhitespaceTokenizer

Source§

type TokenStream<'a> = WhitespaceTokenStream<'a>