Struct NormalizationRustStemmers

Source

pub struct NormalizationRustStemmers {
    pub anyway_above_confidence: f64,
}

Expand description

Will run stemming with the language tagged onto the token if an algorithm is available.

This uses the rust_stemmers crate under the hood.

This is recommended to be run after an AugmentationDetectLanguage has been used, it will not do anything if no language metadata is available!

Tokens will be ignored if:

They are known to not be an SegmentedTokenKind::AlphaNumeric
They already have normalized_text set. Apply things like lowercasing after this.

Fields§

§anyway_above_confidence: f64

Thereshold above which the flag about the lnguage detection flagging itself as reliable is ignored and the detected lnguage used for normalization anyway. Setting this can help with shorter texts.

1.0 which translates to never ignore the flag. 0.0 would mean to always ignore it.

Default is 0.4 as that is usually “good enough” for correct stemming.

Implementations§

Source §

impl NormalizationRustStemmers

Source

pub fn new() -> Self

Create a new NormalizationRustStemmers instance with the default settings.

Source

pub fn set_anyway_above_confidence(self, anyway_above_confidence: f64) -> Self

Adjust the value of anyway_above_confidence builder style.

Trait Implementations§

Source §

impl Augmenter for NormalizationRustStemmers

Source §

fn augment<'a>(&self, token: SegmentedToken<'a>) -> SegmentedToken<'a>

Apply augmentation function to the given token and return it.

Source §

impl Clone for NormalizationRustStemmers

Source §

fn clone(&self) -> NormalizationRustStemmers

Returns a duplicate of the value. Read more

1.0.0 · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more

Source §

impl Debug for NormalizationRustStemmers

Source §

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more

Source §

impl Default for NormalizationRustStemmers

Source §

fn default() -> Self

Returns the “default value” for a type. Read more

Auto Trait Implementations§

§

impl UnwindSafe for NormalizationRustStemmers

Blanket Implementations§

Source §

impl<T> Any for T
where T: 'static + ?Sized,

Source §

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more

Source §

impl<T> Borrow<T> for T
where T: ?Sized,

Source §

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more

Source §

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source §

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more

Source §

impl<T> CloneToUninit for T
where T: Clone,

Source §

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)

Performs copy-assignment from self to dest. Read more

Source §

impl<T> From<T> for T

Source §

fn from(t: T) -> T

Returns the argument unchanged.

Source §

impl<T, U> Into for T
where U: From<T>,

Source §

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source §

impl<T> Segmenter for T
where T: Augmenter,

Source §

type SubdivisionIter<'a> = IntoIter<SegmentedToken<'a>>

The iterator type returned by the subdivide function if it has multiple results. Read more

Source §

fn subdivide<'a>( &self, token: SegmentedToken<'a>, ) -> UseOrSubdivide<SegmentedToken<'a>, <T as Segmenter>::SubdivisionIter<'a>> ⓘ

A method that should split the given token into zero, one or more subtokens. Read more

Source §

impl<T> ToOwned for T
where T: Clone,

Source §

type Owned = T

The resulting type after obtaining ownership.

Source §

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more

Source §

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more

Source §

impl<T, U> TryFrom for T
where U: Into<T>,

Source §

type Error = Infallible

The type returned in the event of a conversion error.

Source §

fn try_from(value: U) -> Result<T, <T as TryFrom>::Error>

Performs the conversion.

Source §

impl<T, U> TryInto for T
where U: TryFrom<T>,

Source §

type Error = >::Error

The type returned in the event of a conversion error.

Source §

fn try_into(self) -> Result<U, >::Error>

Performs the conversion.

Struct NormalizationRustStemmersCopy item path

Fields§

Implementations§

impl NormalizationRustStemmers

pub fn new() -> Self

pub fn set_anyway_above_confidence(self, anyway_above_confidence: f64) -> Self

Trait Implementations§

impl Augmenter for NormalizationRustStemmers

fn augment<'a>(&self, token: SegmentedToken<'a>) -> SegmentedToken<'a>

impl Clone for NormalizationRustStemmers

fn clone(&self) -> NormalizationRustStemmers

fn clone_from(&mut self, source: &Self)

impl Debug for NormalizationRustStemmers

fn fmt(&self, f: &mut Formatter<'_>) -> Result

impl Default for NormalizationRustStemmers

fn default() -> Self

Auto Trait Implementations§

impl Freeze for NormalizationRustStemmers

impl RefUnwindSafe for NormalizationRustStemmers

impl Send for NormalizationRustStemmers

impl Sync for NormalizationRustStemmers

impl Unpin for NormalizationRustStemmers

impl UnwindSafe for NormalizationRustStemmers

Blanket Implementations§

impl<T> Any for Twhere T: 'static + ?Sized,

fn type_id(&self) -> TypeId

impl<T> Borrow<T> for Twhere T: ?Sized,

fn borrow(&self) -> &T

impl<T> BorrowMut<T> for Twhere T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

impl<T> CloneToUninit for Twhere T: Clone,

unsafe fn clone_to_uninit(&self, dest: *mut u8)

impl<T> From<T> for T

fn from(t: T) -> T

impl<T, U> Into<U> for Twhere U: From<T>,

fn into(self) -> U

impl<T> Segmenter for Twhere T: Augmenter,

type SubdivisionIter<'a> = IntoIter<SegmentedToken<'a>>

fn subdivide<'a>( &self, token: SegmentedToken<'a>, ) -> UseOrSubdivide<SegmentedToken<'a>, <T as Segmenter>::SubdivisionIter<'a>> ⓘ

impl<T> ToOwned for Twhere T: Clone,

type Owned = T

fn to_owned(&self) -> T

fn clone_into(&self, target: &mut T)

impl<T, U> TryFrom<U> for Twhere U: Into<T>,

type Error = Infallible

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

impl<T, U> TryInto<U> for Twhere U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Struct NormalizationRustStemmers

impl<T> Any for T
where T: 'static + ?Sized,

impl<T> Borrow<T> for T
where T: ?Sized,

impl<T> BorrowMut<T> for T
where T: ?Sized,

impl<T> CloneToUninit for T
where T: Clone,

impl<T, U> Into<U> for T
where U: From<T>,

impl<T> Segmenter for T
where T: Augmenter,

impl<T> ToOwned for T
where T: Clone,

impl<T, U> TryFrom<U> for T
where U: Into<T>,

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,