pub struct LanguageAwareTokenizer<F>{ /* private fields */ }Expand description
Language-aware tokenizer that can be configured per-field
This allows selecting the stemmer language based on document metadata, such as a “language” field in the document.
Implementations§
Source§impl<F> LanguageAwareTokenizer<F>
impl<F> LanguageAwareTokenizer<F>
Sourcepub fn new(language_selector: F) -> Self
pub fn new(language_selector: F) -> Self
Create a new language-aware tokenizer with a custom language selector
The selector function receives a language hint (e.g., from a document field) and returns the appropriate Language to use for stemming.
§Example
ⓘ
let tokenizer = LanguageAwareTokenizer::new(|hint| {
match hint {
"en" | "english" => Language::English,
"de" | "german" => Language::German,
"ru" | "russian" => Language::Russian,
_ => Language::English,
}
});Trait Implementations§
Source§impl<F> Clone for LanguageAwareTokenizer<F>
impl<F> Clone for LanguageAwareTokenizer<F>
Source§fn clone(&self) -> LanguageAwareTokenizer<F>
fn clone(&self) -> LanguageAwareTokenizer<F>
Returns a duplicate of the value. Read more
1.0.0 · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
Performs copy-assignment from
source. Read moreAuto Trait Implementations§
impl<F> Freeze for LanguageAwareTokenizer<F>where
F: Freeze,
impl<F> RefUnwindSafe for LanguageAwareTokenizer<F>where
F: RefUnwindSafe,
impl<F> Send for LanguageAwareTokenizer<F>
impl<F> Sync for LanguageAwareTokenizer<F>
impl<F> Unpin for LanguageAwareTokenizer<F>where
F: Unpin,
impl<F> UnwindSafe for LanguageAwareTokenizer<F>where
F: UnwindSafe,
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> CloneToUninit for Twhere
T: Clone,
impl<T> CloneToUninit for Twhere
T: Clone,
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§impl<T> Pointable for T
impl<T> Pointable for T
Source§impl<SS, SP> SupersetOf<SS> for SPwhere
SS: SubsetOf<SP>,
impl<SS, SP> SupersetOf<SS> for SPwhere
SS: SubsetOf<SP>,
Source§fn to_subset(&self) -> Option<SS>
fn to_subset(&self) -> Option<SS>
The inverse inclusion map: attempts to construct
self from the equivalent element of its
superset. Read moreSource§fn is_in_subset(&self) -> bool
fn is_in_subset(&self) -> bool
Checks if
self is actually part of its subset T (and can be converted to it).Source§fn to_subset_unchecked(&self) -> SS
fn to_subset_unchecked(&self) -> SS
Use with care! Same as
self.to_subset but without any property checks. Always succeeds.Source§fn from_subset(element: &SS) -> SP
fn from_subset(element: &SS) -> SP
The inclusion map: converts
self to the equivalent element of its superset.