pub struct UnicodeWordSplitter {}
Expand description
Split text into words according to the Unicode definition of what a word is. While not perfect, it should work well enough as an easy starting point.
Uses UnicodeSegmentation::split_sentence_bounds under the hood.
Implementations§
Trait Implementations§
Source§impl Clone for UnicodeWordSplitter
impl Clone for UnicodeWordSplitter
Source§fn clone(&self) -> UnicodeWordSplitter
fn clone(&self) -> UnicodeWordSplitter
Returns a duplicate of the value. Read more
1.0.0 · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
Performs copy-assignment from
source
. Read moreSource§impl Debug for UnicodeWordSplitter
impl Debug for UnicodeWordSplitter
Source§impl Default for UnicodeWordSplitter
impl Default for UnicodeWordSplitter
Source§fn default() -> UnicodeWordSplitter
fn default() -> UnicodeWordSplitter
Returns the “default value” for a type. Read more
Source§impl Segmenter for UnicodeWordSplitter
impl Segmenter for UnicodeWordSplitter
Source§type SubdivisionIter<'a> = IntoIter<SegmentedToken<'a>>
type SubdivisionIter<'a> = IntoIter<SegmentedToken<'a>>
The iterator type returned by the
subdivide
function if it has multiple results. Read moreSource§fn subdivide<'a>(
&self,
token: SegmentedToken<'a>,
) -> UseOrSubdivide<SegmentedToken<'a>, IntoIter<SegmentedToken<'a>>> ⓘ
fn subdivide<'a>( &self, token: SegmentedToken<'a>, ) -> UseOrSubdivide<SegmentedToken<'a>, IntoIter<SegmentedToken<'a>>> ⓘ
A method that should split the given
token
into zero, one or more subtokens. Read moreAuto Trait Implementations§
impl Freeze for UnicodeWordSplitter
impl RefUnwindSafe for UnicodeWordSplitter
impl Send for UnicodeWordSplitter
impl Sync for UnicodeWordSplitter
impl Unpin for UnicodeWordSplitter
impl UnwindSafe for UnicodeWordSplitter
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more