pub struct Ngrams { /* private fields */ }Expand description
An n-gram counter for counting n-gram frequencies.
Accumulates n-gram counts from sequences of elements. N-grams do not cross sequence boundaries.
Implementations§
Source§impl Ngrams
impl Ngrams
Sourcepub fn new(n: usize, min_n: Option<usize>) -> PyResult<Self>
pub fn new(n: usize, min_n: Option<usize>) -> PyResult<Self>
Create a new empty Ngrams.
§Arguments
n- The n-gram order (1 for unigrams, 2 for bigrams, etc.). Must be >= 1.
Sourcepub fn count(&mut self, seq: Vec<String>)
pub fn count(&mut self, seq: Vec<String>)
Count n-grams from a single sequence.
Extracts all n-grams of the configured order from the sequence and increments their counts. N-grams do not cross sequence boundaries.
Sourcepub fn count_seqs(&mut self, seqs: Vec<Vec<String>>)
pub fn count_seqs(&mut self, seqs: Vec<Vec<String>>)
Count n-grams from multiple sequences.
Each sequence is treated independently (n-grams do not cross boundaries).
Sourcepub fn get(&self, ngram: Vec<String>) -> u64
pub fn get(&self, ngram: Vec<String>) -> u64
Return the count for a specific n-gram.
Returns 0 if the n-gram has not been observed.
Sourcepub fn most_common(
&self,
py: Python<'_>,
n: Option<usize>,
order: Option<usize>,
) -> PyResult<PyObject>
pub fn most_common( &self, py: Python<'_>, n: Option<usize>, order: Option<usize>, ) -> PyResult<PyObject>
Return the n most common n-grams with their counts.
If n is None, returns all n-grams sorted by count (descending).
Sourcepub fn items(&self, py: Python<'_>, order: Option<usize>) -> PyResult<PyObject>
pub fn items(&self, py: Python<'_>, order: Option<usize>) -> PyResult<PyObject>
Return all (n-gram, count) pairs.
Sourcepub fn total(&self, order: Option<usize>) -> PyResult<u64>
pub fn total(&self, order: Option<usize>) -> PyResult<u64>
Return the total number of n-gram tokens counted.
Trait Implementations§
Source§impl<'py> IntoPyObject<'py> for Ngrams
impl<'py> IntoPyObject<'py> for Ngrams
Source§type Output = Bound<'py, <Ngrams as IntoPyObject<'py>>::Target>
type Output = Bound<'py, <Ngrams as IntoPyObject<'py>>::Target>
The smart pointer type to use. Read more
Source§fn into_pyobject(
self,
py: Python<'py>,
) -> Result<<Self as IntoPyObject<'_>>::Output, <Self as IntoPyObject<'_>>::Error>
fn into_pyobject( self, py: Python<'py>, ) -> Result<<Self as IntoPyObject<'_>>::Output, <Self as IntoPyObject<'_>>::Error>
Performs the conversion.
Source§impl PyClassImpl for Ngrams
impl PyClassImpl for Ngrams
Source§const IS_BASETYPE: bool = false
const IS_BASETYPE: bool = false
#[pyclass(subclass)]
Source§const IS_SUBCLASS: bool = false
const IS_SUBCLASS: bool = false
#[pyclass(extends=…)]
Source§const IS_MAPPING: bool = false
const IS_MAPPING: bool = false
#[pyclass(mapping)]
Source§const IS_SEQUENCE: bool = false
const IS_SEQUENCE: bool = false
#[pyclass(sequence)]
Source§const IS_IMMUTABLE_TYPE: bool = false
const IS_IMMUTABLE_TYPE: bool = false
#[pyclass(immutable_type)]
Source§type ThreadChecker = SendablePyClass<Ngrams>
type ThreadChecker = SendablePyClass<Ngrams>
This handles following two situations: Read more
Source§type PyClassMutability = <<PyAny as PyClassBaseType>::PyClassMutability as PyClassMutability>::MutableChild
type PyClassMutability = <<PyAny as PyClassBaseType>::PyClassMutability as PyClassMutability>::MutableChild
Immutable or mutable
Source§type BaseNativeType = PyAny
type BaseNativeType = PyAny
The closest native ancestor. This is
PyAny by default, and when you declare
#[pyclass(extends=PyDict)], it’s PyDict.fn items_iter() -> PyClassItemsIter
fn lazy_type_object() -> &'static LazyTypeObject<Self>
fn dict_offset() -> Option<isize>
fn weaklist_offset() -> Option<isize>
Source§impl PyClassNewTextSignature<Ngrams> for PyClassImplCollector<Ngrams>
impl PyClassNewTextSignature<Ngrams> for PyClassImplCollector<Ngrams>
fn new_text_signature(self) -> Option<&'static str>
Source§impl PyMethods<Ngrams> for PyClassImplCollector<Ngrams>
impl PyMethods<Ngrams> for PyClassImplCollector<Ngrams>
fn py_methods(self) -> &'static PyClassItems
Source§impl PyTypeInfo for Ngrams
impl PyTypeInfo for Ngrams
Source§fn type_object_raw(py: Python<'_>) -> *mut PyTypeObject
fn type_object_raw(py: Python<'_>) -> *mut PyTypeObject
Returns the PyTypeObject instance for this type.
Source§fn type_object(py: Python<'_>) -> Bound<'_, PyType>
fn type_object(py: Python<'_>) -> Bound<'_, PyType>
Returns the safe abstraction over the type object.
impl DerefToPyAny for Ngrams
Auto Trait Implementations§
impl Freeze for Ngrams
impl RefUnwindSafe for Ngrams
impl Send for Ngrams
impl Sync for Ngrams
impl Unpin for Ngrams
impl UnsafeUnpin for Ngrams
impl UnwindSafe for Ngrams
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more
Source§impl<T> CloneToUninit for Twhere
T: Clone,
impl<T> CloneToUninit for Twhere
T: Clone,
Source§impl<T> FromPyObject<'_> for T
impl<T> FromPyObject<'_> for T
Source§impl<'py, T> FromPyObjectBound<'_, 'py> for Twhere
T: FromPyObject<'py>,
impl<'py, T> FromPyObjectBound<'_, 'py> for Twhere
T: FromPyObject<'py>,
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
Converts
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§impl<'py, T> IntoPyObjectExt<'py> for Twhere
T: IntoPyObject<'py>,
impl<'py, T> IntoPyObjectExt<'py> for Twhere
T: IntoPyObject<'py>,
Source§fn into_bound_py_any(self, py: Python<'py>) -> Result<Bound<'py, PyAny>, PyErr>
fn into_bound_py_any(self, py: Python<'py>) -> Result<Bound<'py, PyAny>, PyErr>
Converts
self into an owned Python object, dropping type information.