Struct WordFreq

Source
pub struct WordFreq { /* private fields */ }
Expand description

Implementation of wordfreq.

Implementations§

Source§

impl WordFreq

Source

pub fn new<I, W>(word_weights: I) -> Self
where I: IntoIterator<Item = (W, Float)>, W: AsRef<str>,

Creates an instance from frequencies.

§Arguments
  • word_weights: Pairs of words and their frequencies (or probabilities) from a corpus.
§Notes

If the input contains duplicate words, the last occurrence is used.

Source

pub fn minimum(self, minimum: Float) -> Result<Self>

Sets the lower bound of returned frequencies (default is 0.0).

An error is returned if the input is negative.

Source

pub fn standardizer(self, standardizer: Standardizer) -> Self

Sets the standardizer for preprocessing words.

If set, the standardizer is always applied to words before looking up their frequencies.

Source

pub fn word_frequency<W>(&self, word: W) -> Float
where W: AsRef<str>,

Returns the word’s frequency, normalized between 0.0 and 1.0.

§Examples
use approx::assert_relative_eq;
use wordfreq::WordFreq;

let word_weights = [("las", 10.), ("vegas", 30.)];
let wf = WordFreq::new(word_weights);

assert_relative_eq!(wf.word_frequency("las"), 0.25);
assert_relative_eq!(wf.word_frequency("vegas"), 0.75);
assert_relative_eq!(wf.word_frequency("Las"), 0.00);
Source

pub fn zipf_frequency<W>(&self, word: W) -> Float
where W: AsRef<str>,

Returns the Zipf frequency of a word as a human-friendly logarithmic scale.

§Examples
use approx::assert_relative_eq;
use wordfreq::WordFreq;

let word_weights = [("las", 10.), ("vegas", 30.)];
let wf = WordFreq::new(word_weights);

assert_relative_eq!(wf.zipf_frequency("las"), 8.4);
assert_relative_eq!(wf.zipf_frequency("vegas"), 8.88);
assert_relative_eq!(wf.zipf_frequency("Las"), 0.00);
Source

pub fn serialize(&self) -> Result<Vec<u8>>

Exports the model data.

Note that the format is distinct from the one used in the oritinal Python package.

Source

pub fn deserialize(bytes: &[u8]) -> Result<Self>

Deserializes the model, which is exported by WordFreq::serialize().

Source

pub const fn word_frequency_map(&self) -> &HashMap<String, Float>

Returns the reference to the internal word-frequency map.

Trait Implementations§

Source§

impl Clone for WordFreq

Source§

fn clone(&self) -> WordFreq

Returns a copy of the value. Read more
1.0.0 · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more

Auto Trait Implementations§

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> CloneToUninit for T
where T: Clone,

Source§

unsafe fn clone_to_uninit(&self, dst: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)
Performs copy-assignment from self to dst. Read more
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T> ToOwned for T
where T: Clone,

Source§

type Owned = T

The resulting type after obtaining ownership.
Source§

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more
Source§

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more
Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.