pub struct TextPreprocessor {
pub lowercase: bool,
pub remove_stopwords: bool,
pub lemmatize: bool,
pub stem: bool,
pub stopwords: Vec<String>,
}
Expand description
Text preprocessing for NLP
Fields§
§lowercase: bool
Whether to convert to lowercase
remove_stopwords: bool
Whether to remove stopwords
lemmatize: bool
Whether to lemmatize
stem: bool
Whether to stem
stopwords: Vec<String>
Custom stopwords
Implementations§
Source§impl TextPreprocessor
impl TextPreprocessor
Sourcepub fn with_lowercase(self, lowercase: bool) -> Self
pub fn with_lowercase(self, lowercase: bool) -> Self
Sets whether to convert to lowercase
Sourcepub fn with_remove_stopwords(self, remove_stopwords: bool) -> Self
pub fn with_remove_stopwords(self, remove_stopwords: bool) -> Self
Sets whether to remove stopwords
Sourcepub fn with_lemmatize(self, lemmatize: bool) -> Self
pub fn with_lemmatize(self, lemmatize: bool) -> Self
Sets whether to lemmatize
Sourcepub fn with_stopwords(self, stopwords: Vec<String>) -> Self
pub fn with_stopwords(self, stopwords: Vec<String>) -> Self
Sets custom stopwords
Sourcepub fn preprocess(&self, text: &str) -> Result<String>
pub fn preprocess(&self, text: &str) -> Result<String>
Preprocesses text
Trait Implementations§
Source§impl Clone for TextPreprocessor
impl Clone for TextPreprocessor
Source§fn clone(&self) -> TextPreprocessor
fn clone(&self) -> TextPreprocessor
Returns a duplicate of the value. Read more
1.0.0 · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
Performs copy-assignment from
source
. Read moreAuto Trait Implementations§
impl Freeze for TextPreprocessor
impl RefUnwindSafe for TextPreprocessor
impl Send for TextPreprocessor
impl Sync for TextPreprocessor
impl Unpin for TextPreprocessor
impl UnwindSafe for TextPreprocessor
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> CloneToUninit for Twhere
T: Clone,
impl<T> CloneToUninit for Twhere
T: Clone,
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self
into a Left
variant of Either<Self, Self>
if into_left
is true
.
Converts self
into a Right
variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self
into a Left
variant of Either<Self, Self>
if into_left(&self)
returns true
.
Converts self
into a Right
variant of Either<Self, Self>
otherwise. Read moreSource§impl<T> Pointable for T
impl<T> Pointable for T
Source§impl<SS, SP> SupersetOf<SS> for SPwhere
SS: SubsetOf<SP>,
impl<SS, SP> SupersetOf<SS> for SPwhere
SS: SubsetOf<SP>,
Source§fn to_subset(&self) -> Option<SS>
fn to_subset(&self) -> Option<SS>
The inverse inclusion map: attempts to construct
self
from the equivalent element of its
superset. Read moreSource§fn is_in_subset(&self) -> bool
fn is_in_subset(&self) -> bool
Checks if
self
is actually part of its subset T
(and can be converted to it).Source§fn to_subset_unchecked(&self) -> SS
fn to_subset_unchecked(&self) -> SS
Use with care! Same as
self.to_subset
but without any property checks. Always succeeds.Source§fn from_subset(element: &SS) -> SP
fn from_subset(element: &SS) -> SP
The inclusion map: converts
self
to the equivalent element of its superset.