Struct tantivy::tokenizer::TokenizerManager [] [src]

pub struct TokenizerManager { /* fields omitted */ }

The tokenizer manager serves as a store for all of the pre-configured tokenizer pipelines.

By default, it is populated with the following managers.

  • raw : does not process nor tokenize the text.
  • default : Chops the text on according to whitespace and punctuation, removes tokens that are too long, and lowercases tokens
  • en_stem : Like default, but also applies stemming on the resulting tokens. Stemming can improve the recall of your search engine.

Methods

impl TokenizerManager
[src]

[src]

Registers a new tokenizer associated with a given name.

[src]

Accessing a tokenizer given its name.

Trait Implementations

impl Clone for TokenizerManager
[src]

[src]

Returns a copy of the value. Read more

1.0.0
[src]

Performs copy-assignment from source. Read more

impl Default for TokenizerManager
[src]

[src]

Creates an TokenizerManager prepopulated with the default pre-configured tokenizers of tantivy.

  • simple
  • en_stem
  • ja

Auto Trait Implementations