Winsorizer

Struct Winsorizer 

Source
pub struct Winsorizer<State = Untrained> { /* private fields */ }
Expand description

Winsorizer for capping extreme outliers

This transformer limits extreme values by replacing outliers with percentile values. For example, with lower_percentile=5 and upper_percentile=95, all values below the 5th percentile are set to the 5th percentile value, and all values above the 95th percentile are set to the 95th percentile value.

Implementations§

Source§

impl Winsorizer<Untrained>

Source

pub fn new() -> Self

Create a new Winsorizer with default configuration

Source

pub fn with_percentiles(lower: Float, upper: Float) -> Self

Create a Winsorizer with specified percentiles

Source

pub fn with_iqr(multiplier: Float) -> Self

Create a Winsorizer with IQR-based bounds

Source

pub fn lower_percentile(self, percentile: Float) -> Self

Set the lower percentile

Source

pub fn upper_percentile(self, percentile: Float) -> Self

Set the upper percentile

Source

pub fn feature_wise(self, feature_wise: bool) -> Self

Set whether to winsorize features independently

Source

pub fn nan_strategy(self, strategy: NanStrategy) -> Self

Set the NaN handling strategy

Source§

impl Winsorizer<Trained>

Source

pub fn lower_bounds(&self) -> &Array1<Float>

Get the lower bounds

Source

pub fn upper_bounds(&self) -> &Array1<Float>

Get the upper bounds

Source

pub fn n_features_in(&self) -> usize

Get the number of features seen during fitting

Source

pub fn winsorize_single(&self, feature_idx: usize, value: Float) -> Float

Apply winsorization to a single value for a specific feature

Source

pub fn get_winsorization_stats( &self, x: &Array2<Float>, ) -> Result<WinsorizationStats>

Get statistics about the winsorization applied to the data

Trait Implementations§

Source§

impl<State: Clone> Clone for Winsorizer<State>

Source§

fn clone(&self) -> Winsorizer<State>

Returns a duplicate of the value. Read more
1.0.0 · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more
Source§

impl<State: Debug> Debug for Winsorizer<State>

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more
Source§

impl Default for Winsorizer<Untrained>

Source§

fn default() -> Self

Returns the “default value” for a type. Read more
Source§

impl Fit<ArrayBase<OwnedRepr<f64>, Dim<[usize; 2]>>, ()> for Winsorizer<Untrained>

Source§

type Fitted = Winsorizer<Trained>

The fitted model type
Source§

fn fit(self, x: &Array2<Float>, _y: &()) -> Result<Self::Fitted>

Fit the model to the provided data with validation
Source§

fn fit_with_validation( self, x: &X, y: &Y, _x_val: Option<&X>, _y_val: Option<&Y>, ) -> Result<(Self::Fitted, FitMetrics), SklearsError>
where Self: Sized,

Fit with custom validation and early stopping
Source§

impl Transform<ArrayBase<OwnedRepr<f64>, Dim<[usize; 2]>>> for Winsorizer<Trained>

Source§

fn transform(&self, x: &Array2<Float>) -> Result<Array2<Float>>

Transform the input data

Auto Trait Implementations§

§

impl<State> Freeze for Winsorizer<State>

§

impl<State> RefUnwindSafe for Winsorizer<State>
where State: RefUnwindSafe,

§

impl<State> Send for Winsorizer<State>
where State: Send,

§

impl<State> Sync for Winsorizer<State>
where State: Sync,

§

impl<State> Unpin for Winsorizer<State>
where State: Unpin,

§

impl<State> UnwindSafe for Winsorizer<State>
where State: UnwindSafe,

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> CloneToUninit for T
where T: Clone,

Source§

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)
Performs copy-assignment from self to dest. Read more
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T> IntoEither for T

Source§

fn into_either(self, into_left: bool) -> Either<Self, Self>

Converts self into a Left variant of Either<Self, Self> if into_left is true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
Source§

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

Converts self into a Left variant of Either<Self, Self> if into_left(&self) returns true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
Source§

impl<T> Pointable for T

Source§

const ALIGN: usize

The alignment of pointer.
Source§

type Init = T

The type for initializers.
Source§

unsafe fn init(init: <T as Pointable>::Init) -> usize

Initializes a with the given initializer. Read more
Source§

unsafe fn deref<'a>(ptr: usize) -> &'a T

Dereferences the given pointer. Read more
Source§

unsafe fn deref_mut<'a>(ptr: usize) -> &'a mut T

Mutably dereferences the given pointer. Read more
Source§

unsafe fn drop(ptr: usize)

Drops the object pointed to by the given pointer. Read more
Source§

impl<T> ToOwned for T
where T: Clone,

Source§

type Owned = T

The resulting type after obtaining ownership.
Source§

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more
Source§

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more
Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.
Source§

impl<V, T> VZip<V> for T
where V: MultiLane<T>,

Source§

fn vzip(self) -> V