pub struct WhitespaceTokenizer { /* private fields */ }Expand description
Simple whitespace-based tokenizer (English, Spanish, French, etc.).
Splits on whitespace and punctuation. Works for languages with clear word boundaries.
Implementations§
Source§impl WhitespaceTokenizer
impl WhitespaceTokenizer
Sourcepub fn with_punctuation(self, include: bool) -> Self
pub fn with_punctuation(self, include: bool) -> Self
Create with punctuation handling.
Trait Implementations§
Source§impl Default for WhitespaceTokenizer
impl Default for WhitespaceTokenizer
Auto Trait Implementations§
impl Freeze for WhitespaceTokenizer
impl RefUnwindSafe for WhitespaceTokenizer
impl Send for WhitespaceTokenizer
impl Sync for WhitespaceTokenizer
impl Unpin for WhitespaceTokenizer
impl UnsafeUnpin for WhitespaceTokenizer
impl UnwindSafe for WhitespaceTokenizer
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> Instrument for T
impl<T> Instrument for T
Source§fn instrument(self, span: Span) -> Instrumented<Self>
fn instrument(self, span: Span) -> Instrumented<Self>
Source§fn in_current_span(self) -> Instrumented<Self>
fn in_current_span(self) -> Instrumented<Self>
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read more