Struct tantivy::tokenizer::TokenizerManager
[−]
[src]
pub struct TokenizerManager { /* fields omitted */ }
The tokenizer manager serves as a store for all of the pre-configured tokenizer pipelines.
By default, it is populated with the following managers.
raw
: does not process nor tokenize the text.default
: Chops the text on according to whitespace and punctuation, removes tokens that are too long, and lowercases tokensen_stem
: Likedefault
, but also applies stemming on the resulting tokens. Stemming can improve the recall of your search engine.
Methods
impl TokenizerManager
[src]
pub fn register<A>(&self, tokenizer_name: &str, tokenizer: A) where
A: 'static + Send + Sync + for<'a> Tokenizer<'a>,
[src]
A: 'static + Send + Sync + for<'a> Tokenizer<'a>,
Registers a new tokenizer associated with a given name.
pub fn get(&self, tokenizer_name: &str) -> Option<Box<BoxedTokenizer>>
[src]
Accessing a tokenizer given its name.
Trait Implementations
impl Clone for TokenizerManager
[src]
fn clone(&self) -> TokenizerManager
[src]
Returns a copy of the value. Read more
fn clone_from(&mut self, source: &Self)
1.0.0[src]
Performs copy-assignment from source
. Read more
impl Default for TokenizerManager
[src]
fn default() -> TokenizerManager
[src]
Creates an TokenizerManager
prepopulated with
the default pre-configured tokenizers of tantivy
.
- simple
- en_stem
- ja