pub struct UnicodeSegmenter;Expand description
Unicode segmentation-based tokenizer (fallback for CJK and other languages).
Uses Unicode Standard Annex #29 (UAX#29) word boundaries. This is a reasonable fallback but language-specific tokenizers are preferred.
Implementations§
Trait Implementations§
Source§impl Default for UnicodeSegmenter
impl Default for UnicodeSegmenter
Auto Trait Implementations§
impl Freeze for UnicodeSegmenter
impl RefUnwindSafe for UnicodeSegmenter
impl Send for UnicodeSegmenter
impl Sync for UnicodeSegmenter
impl Unpin for UnicodeSegmenter
impl UnsafeUnpin for UnicodeSegmenter
impl UnwindSafe for UnicodeSegmenter
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> Instrument for T
impl<T> Instrument for T
Source§fn instrument(self, span: Span) -> Instrumented<Self>
fn instrument(self, span: Span) -> Instrumented<Self>
Source§fn in_current_span(self) -> Instrumented<Self>
fn in_current_span(self) -> Instrumented<Self>
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read more