Skip to main content

Binarizer

Struct Binarizer 

Source
pub struct Binarizer<F> { /* private fields */ }
Expand description

A stateless feature binarizer.

Values strictly greater than threshold become 1.0; all other values become 0.0. The default threshold is 0.0.

This transformer is stateless — no fitting is needed. Call Transform::transform directly.

§Examples

use ferrolearn_preprocess::binarizer::Binarizer;
use ferrolearn_core::traits::Transform;
use ndarray::array;

let binarizer = Binarizer::<f64>::new(0.5);
let x = array![[0.0, 0.5, 1.0]];
let out = binarizer.transform(&x).unwrap();
// out = [[0.0, 0.0, 1.0]]

Implementations§

Source§

impl<F: Float + Send + Sync + 'static> Binarizer<F>

Source

pub fn new(threshold: F) -> Self

Create a new Binarizer with the given threshold (and the default copy = true).

sklearn constrains threshold to Interval(Real, None, None, closed="neither") on binarize (_data.py:2114-2115) — an OPEN interval (-inf, inf) that EXCLUDES NaN/±inf. A non-finite threshold is NOT rejected by new (no validation at construction, matching sklearn’s __init__, which stores params unchecked); it is rejected later by Fit::fit / Transform::transform / binarize (InvalidParameter), matching sklearn’s _fit_context / @validate_params raising InvalidParameterError at fit/binarize.

Source

pub fn threshold(&self) -> F

Return the configured threshold.

Source

pub fn with_copy(self, copy: bool) -> Self

Set the copy parameter (sklearn Binarizer(copy=...), _data.py:2253, _parameter_constraints {copy:["boolean"]} :2250).

This is an ACCEPT-AND-DOCUMENT no-op: ferrolearn’s Transform always returns a freshly allocated array, so copy has no observable effect on the output. It is retained for API parity with scikit-learn.

Source

pub fn copy(&self) -> bool

Return the configured copy flag (sklearn Binarizer.copy).

Trait Implementations§

Source§

impl<F: Clone> Clone for Binarizer<F>

Source§

fn clone(&self) -> Binarizer<F>

Returns a duplicate of the value. Read more
1.0.0 (const: unstable) · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more
Source§

impl<F: Debug> Debug for Binarizer<F>

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more
Source§

impl<F: Float + Send + Sync + 'static> Default for Binarizer<F>

Source§

fn default() -> Self

Returns the “default value” for a type. Read more
Source§

impl<F: Float + Send + Sync + 'static> Fit<ArrayBase<OwnedRepr<F>, Dim<[usize; 2]>>, ()> for Binarizer<F>

Source§

fn fit(&self, x: &Array2<F>, _y: &()) -> Result<FittedBinarizer<F>, FerroError>

Validate the input and record n_features_in_, returning a FittedBinarizer.

Binarizer is stateless: like scikit-learn’s Binarizer.fit (sklearn/preprocessing/_data.py:2257-2278, “Only validates estimator’s parameters”), this learns NO statistics. It runs the SAME check_array validation as Transform::transform / binarize (REQ-9, via the shared [validate_binarize_input] helper) and records n_features_in_ = x.ncols(). sklearn’s _validate_data uses the default force_all_finite=True, so NaN/±inf are REJECTED in fit (Binarizer().fit([[nan]]) / [[inf]] raise ValueError). sklearn’s _fit_context validates _parameter_constraints (:2249) BEFORE the data, and threshold is constrained to Interval(Real, None, None, closed="neither") on binarize (_data.py:2114) — an OPEN interval (-inf, inf) that EXCLUDES NaN/±inf. A non-finite threshold is therefore rejected here (param-check first, matching _fit_context).

§Errors

Returns FerroError::InvalidParameter if threshold is non-finite (NaN/±inf, sklearn Interval(Real, None, None, closed="neither"), _data.py:2114), FerroError::InsufficientSamples for zero rows, and FerroError::InvalidParameter for zero features or any non-finite value (NaN, +inf, -inf) — matching check_array (sklearn/utils/validation.py:1084, :1093, :1063) as routed through Binarizer.fit -> _validate_data (_data.py:2277).

Source§

type Fitted = FittedBinarizer<F>

The fitted model type returned by fit.
Source§

type Error = FerroError

The error type returned by fit.
Source§

impl<F: Float + Send + Sync + 'static> Transform<ArrayBase<OwnedRepr<F>, Dim<[usize; 2]>>> for Binarizer<F>

Source§

fn transform(&self, x: &Array2<F>) -> Result<Array2<F>, FerroError>

Apply the threshold: values > threshold become 1.0, others become 0.0.

§Errors

Returns FerroError::InsufficientSamples if x has zero rows. This mirrors scikit-learn’s Binarizer.transform (sklearn/preprocessing/_data.py:2301), whose _validate_data -> check_array min-samples check raises ValueError: Found array with 0 sample(s) ... while a minimum of 1 is required by Binarizer.

Returns FerroError::InvalidParameter if x has zero features (columns). This mirrors scikit-learn’s Binarizer.transform (sklearn/preprocessing/_data.py:2301), whose _validate_data -> check_array min-features check (utils/validation.py:1093, ensure_min_features=1) raises ValueError: Found array with 0 feature(s) (shape=(3, 0)) while a minimum of 1 is required by Binarizer.

Returns FerroError::InvalidParameter if x contains any non-finite value (NaN, +inf, or -inf). This mirrors scikit-learn’s Binarizer.transform (sklearn/preprocessing/_data.py:2301), which validates input via check_array(force_all_finite=True) and raises ValueError: Input X contains NaN. / Input X contains infinity ... before applying the threshold comparison.

Source§

type Output = ArrayBase<OwnedRepr<F>, Dim<[usize; 2]>>

The transformed output type.
Source§

type Error = FerroError

The error type returned by transform.

Auto Trait Implementations§

§

impl<F> Freeze for Binarizer<F>
where F: Freeze,

§

impl<F> RefUnwindSafe for Binarizer<F>
where F: RefUnwindSafe,

§

impl<F> Send for Binarizer<F>
where F: Send,

§

impl<F> Sync for Binarizer<F>
where F: Sync,

§

impl<F> Unpin for Binarizer<F>
where F: Unpin,

§

impl<F> UnsafeUnpin for Binarizer<F>
where F: UnsafeUnpin,

§

impl<F> UnwindSafe for Binarizer<F>
where F: UnwindSafe,

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> ByRef<T> for T

Source§

fn by_ref(&self) -> &T

Source§

impl<T> CloneToUninit for T
where T: Clone,

Source§

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)
Performs copy-assignment from self to dest. Read more
Source§

impl<T> DistributionExt for T
where T: ?Sized,

Source§

fn rand<T>(&self, rng: &mut (impl Rng + ?Sized)) -> T
where Self: Distribution<T>,

Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<T, U> Imply<T> for U
where T: ?Sized, U: ?Sized,

Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T> IntoEither for T

Source§

fn into_either(self, into_left: bool) -> Either<Self, Self>

Converts self into a Left variant of Either<Self, Self> if into_left is true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
Source§

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

Converts self into a Left variant of Either<Self, Self> if into_left(&self) returns true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
Source§

impl<T> Pointable for T

Source§

const ALIGN: usize

The alignment of pointer.
Source§

type Init = T

The type for initializers.
Source§

unsafe fn init(init: <T as Pointable>::Init) -> usize

Initializes a with the given initializer. Read more
Source§

unsafe fn deref<'a>(ptr: usize) -> &'a T

Dereferences the given pointer. Read more
Source§

unsafe fn deref_mut<'a>(ptr: usize) -> &'a mut T

Mutably dereferences the given pointer. Read more
Source§

unsafe fn drop(ptr: usize)

Drops the object pointed to by the given pointer. Read more
Source§

impl<T> Same for T

Source§

type Output = T

Should always be Self
Source§

impl<SS, SP> SupersetOf<SS> for SP
where SS: SubsetOf<SP>,

Source§

fn to_subset(&self) -> Option<SS>

The inverse inclusion map: attempts to construct self from the equivalent element of its superset. Read more
Source§

fn is_in_subset(&self) -> bool

Checks if self is actually part of its subset T (and can be converted to it).
Source§

fn to_subset_unchecked(&self) -> SS

Use with care! Same as self.to_subset but without any property checks. Always succeeds.
Source§

fn from_subset(element: &SS) -> SP

The inclusion map: converts self to the equivalent element of its superset.
Source§

impl<SS, SP> SupersetOf<SS> for SP
where SS: SubsetOf<SP>,

Source§

fn to_subset(&self) -> Option<SS>

The inverse inclusion map: attempts to construct self from the equivalent element of its superset. Read more
Source§

fn is_in_subset(&self) -> bool

Checks if self is actually part of its subset T (and can be converted to it).
Source§

unsafe fn to_subset_unchecked(&self) -> SS

Use with care! Same as self.to_subset but without any property checks. Always succeeds.
Source§

fn from_subset(element: &SS) -> SP

The inclusion map: converts self to the equivalent element of its superset.
Source§

impl<T> ToOwned for T
where T: Clone,

Source§

type Owned = T

The resulting type after obtaining ownership.
Source§

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more
Source§

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more
Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.
Source§

impl<V, T> VZip<V> for T
where V: MultiLane<T>,

Source§

fn vzip(self) -> V