pub struct StandardTokenizer;Expand description
Unicode Text Segmentation tokenizer (UAX#29 word boundaries).
The default tokenizer for standard and stop analyzers. Splits text
on Unicode word boundaries, keeping only “word” segments (skipping
whitespace and punctuation).
See [[analyzers#Tokenizer]] and UAX#29.
Trait Implementations§
Auto Trait Implementations§
impl Freeze for StandardTokenizer
impl RefUnwindSafe for StandardTokenizer
impl Send for StandardTokenizer
impl Sync for StandardTokenizer
impl Unpin for StandardTokenizer
impl UnsafeUnpin for StandardTokenizer
impl UnwindSafe for StandardTokenizer
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read more