pub struct BasicTextCleaner { /* private fields */ }Expand description
Text cleaner for removing special characters, extra whitespace, and stopwords
Implementations§
Source§impl BasicTextCleaner
impl BasicTextCleaner
Sourcepub fn new(
remove_special_chars: bool,
remove_stopwords: bool,
normalize_whitespace: bool,
) -> Self
pub fn new( remove_special_chars: bool, remove_stopwords: bool, normalize_whitespace: bool, ) -> Self
Create a new text cleaner
Sourcepub fn with_stopwords(
remove_special_chars: bool,
remove_stopwords: bool,
normalize_whitespace: bool,
stopwords: HashSet<String>,
) -> Self
pub fn with_stopwords( remove_special_chars: bool, remove_stopwords: bool, normalize_whitespace: bool, stopwords: HashSet<String>, ) -> Self
Create a text cleaner with custom stopwords
Sourcepub fn add_stopwords(&mut self, words: &[&str])
pub fn add_stopwords(&mut self, words: &[&str])
Add stopwords to the cleaner
Sourcepub fn is_stopword(&self, word: &str) -> bool
pub fn is_stopword(&self, word: &str) -> bool
Check if a word is a stopword
Trait Implementations§
Source§impl Clone for BasicTextCleaner
impl Clone for BasicTextCleaner
Source§fn clone(&self) -> BasicTextCleaner
fn clone(&self) -> BasicTextCleaner
Returns a duplicate of the value. Read more
1.0.0 · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
Performs copy-assignment from
source. Read moreSource§impl Debug for BasicTextCleaner
impl Debug for BasicTextCleaner
Source§impl Default for BasicTextCleaner
impl Default for BasicTextCleaner
Auto Trait Implementations§
impl Freeze for BasicTextCleaner
impl RefUnwindSafe for BasicTextCleaner
impl Send for BasicTextCleaner
impl Sync for BasicTextCleaner
impl Unpin for BasicTextCleaner
impl UnsafeUnpin for BasicTextCleaner
impl UnwindSafe for BasicTextCleaner
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> CloneToUninit for Twhere
T: Clone,
impl<T> CloneToUninit for Twhere
T: Clone,
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§impl<T> Pointable for T
impl<T> Pointable for T
Source§impl<SS, SP> SupersetOf<SS> for SPwhere
SS: SubsetOf<SP>,
impl<SS, SP> SupersetOf<SS> for SPwhere
SS: SubsetOf<SP>,
Source§fn to_subset(&self) -> Option<SS>
fn to_subset(&self) -> Option<SS>
The inverse inclusion map: attempts to construct
self from the equivalent element of its
superset. Read moreSource§fn is_in_subset(&self) -> bool
fn is_in_subset(&self) -> bool
Checks if
self is actually part of its subset T (and can be converted to it).Source§fn to_subset_unchecked(&self) -> SS
fn to_subset_unchecked(&self) -> SS
Use with care! Same as
self.to_subset but without any property checks. Always succeeds.Source§fn from_subset(element: &SS) -> SP
fn from_subset(element: &SS) -> SP
The inclusion map: converts
self to the equivalent element of its superset.