Struct charabia::Tokenizer

source ·
pub struct Tokenizer<'al, 'sw, A> { /* private fields */ }
Expand description

Structure used to tokenize a text with custom configurations.

See TokenizerBuilder to know how to build a Tokenizer.

Implementations§

Creates an Iterator over Tokens.

The provided text is segmented creating tokens, then tokens are normalized and classified.

Same as [tokenize] but attaches each Token to its corresponding portion of the original text.

Segments the provided text creating an Iterator over Token.

Segments the provided text creating an Iterator over &str.

Auto Trait Implementations§

Blanket Implementations§

Gets the TypeId of self. Read more
Immutably borrows from an owned value. Read more
Mutably borrows from an owned value. Read more

Returns the argument unchanged.

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

The type returned in the event of a conversion error.
Performs the conversion.
The type returned in the event of a conversion error.
Performs the conversion.