pub struct FrequencyFilter {
pub min_count: usize,
pub max_count: Option<usize>,
pub max_freq: Option<f64>,
/* private fields */
}Expand description
Filter tokens by frequency in a corpus
Fields§
§min_count: usizeMinimum token frequency
max_count: Option<usize>Maximum token frequency (absolute count)
max_freq: Option<f64>Maximum token frequency (as a fraction of total)
Implementations§
Source§impl FrequencyFilter
impl FrequencyFilter
Sourcepub fn from_tokens_with_vocabulary(
tokens: &[String],
vocabulary: &Vocabulary,
min_count: usize,
) -> Self
pub fn from_tokens_with_vocabulary( tokens: &[String], vocabulary: &Vocabulary, min_count: usize, ) -> Self
Create a new frequency filter from tokens with a vocabulary for reference
Sourcepub fn from_counts(
_token_counts: HashMap<String, usize>,
mincount: usize,
) -> Self
pub fn from_counts( _token_counts: HashMap<String, usize>, mincount: usize, ) -> Self
Create a new frequency filter from token counts
Sourcepub fn learn_from_corpus(
texts: &[&str],
tokenizer: &dyn Tokenizer,
min_count: usize,
) -> Result<Self>
pub fn learn_from_corpus( texts: &[&str], tokenizer: &dyn Tokenizer, min_count: usize, ) -> Result<Self>
Learn token frequencies from a corpus
Sourcepub fn with_max_count(self, maxcount: usize) -> Self
pub fn with_max_count(self, maxcount: usize) -> Self
Set the maximum count threshold
Sourcepub fn with_max_freq(self, maxfreq: f64) -> Result<Self>
pub fn with_max_freq(self, maxfreq: f64) -> Result<Self>
Set the maximum frequency threshold (0.0 to 1.0)
Trait Implementations§
Source§impl Clone for FrequencyFilter
impl Clone for FrequencyFilter
Source§fn clone(&self) -> FrequencyFilter
fn clone(&self) -> FrequencyFilter
Returns a duplicate of the value. Read more
1.0.0 · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
Performs copy-assignment from
source. Read moreSource§impl Debug for FrequencyFilter
impl Debug for FrequencyFilter
Auto Trait Implementations§
impl Freeze for FrequencyFilter
impl RefUnwindSafe for FrequencyFilter
impl Send for FrequencyFilter
impl Sync for FrequencyFilter
impl Unpin for FrequencyFilter
impl UnsafeUnpin for FrequencyFilter
impl UnwindSafe for FrequencyFilter
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> CloneToUninit for Twhere
T: Clone,
impl<T> CloneToUninit for Twhere
T: Clone,
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§impl<T> Pointable for T
impl<T> Pointable for T
Source§impl<SS, SP> SupersetOf<SS> for SPwhere
SS: SubsetOf<SP>,
impl<SS, SP> SupersetOf<SS> for SPwhere
SS: SubsetOf<SP>,
Source§fn to_subset(&self) -> Option<SS>
fn to_subset(&self) -> Option<SS>
The inverse inclusion map: attempts to construct
self from the equivalent element of its
superset. Read moreSource§fn is_in_subset(&self) -> bool
fn is_in_subset(&self) -> bool
Checks if
self is actually part of its subset T (and can be converted to it).Source§fn to_subset_unchecked(&self) -> SS
fn to_subset_unchecked(&self) -> SS
Use with care! Same as
self.to_subset but without any property checks. Always succeeds.Source§fn from_subset(element: &SS) -> SP
fn from_subset(element: &SS) -> SP
The inclusion map: converts
self to the equivalent element of its superset.