pub struct LanguageAwareTokenizer<F>{ /* private fields */ }Expand description
Language-aware tokenizer that can be configured per-field
This allows selecting the stemmer language based on document metadata, such as a “language” field in the document.
Implementations§
Source§impl<F> LanguageAwareTokenizer<F>
impl<F> LanguageAwareTokenizer<F>
Sourcepub fn new(language_selector: F) -> Self
pub fn new(language_selector: F) -> Self
Create a new language-aware tokenizer with a custom language selector
The selector function receives a language hint (e.g., from a document field) and returns the appropriate Language to use for stemming.
§Example
ⓘ
let tokenizer = LanguageAwareTokenizer::new(|hint| {
match hint {
"en" | "english" => Language::English,
"de" | "german" => Language::German,
"ru" | "russian" => Language::Russian,
_ => Language::English,
}
});Trait Implementations§
Source§impl<F> Clone for LanguageAwareTokenizer<F>
impl<F> Clone for LanguageAwareTokenizer<F>
Source§fn clone(&self) -> LanguageAwareTokenizer<F>
fn clone(&self) -> LanguageAwareTokenizer<F>
Returns a duplicate of the value. Read more
1.0.0 · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
Performs copy-assignment from
source. Read moreAuto Trait Implementations§
impl<F> Freeze for LanguageAwareTokenizer<F>where
F: Freeze,
impl<F> RefUnwindSafe for LanguageAwareTokenizer<F>where
F: RefUnwindSafe,
impl<F> Send for LanguageAwareTokenizer<F>
impl<F> Sync for LanguageAwareTokenizer<F>
impl<F> Unpin for LanguageAwareTokenizer<F>where
F: Unpin,
impl<F> UnwindSafe for LanguageAwareTokenizer<F>where
F: UnwindSafe,
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> CloneToUninit for Twhere
T: Clone,
impl<T> CloneToUninit for Twhere
T: Clone,
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read more