Skip to main content

Module preprocess

Module preprocess 

Source
Expand description

Data preprocessing transformers.

Provides scalers, encoders, dimensionality reduction, and a composable Transformer trait for building preprocessing pipelines.

Structs§

ColumnTransformer
Apply different transformers to different column subsets, then concatenate all transformed outputs.
LabelEncoder
Encode string labels as integer indices.
MinMaxScaler
Scale features to a [0, 1] range.
Normalizer
Normalize samples individually to unit norm.
OneHotEncoder
One-hot encoder for integer-encoded categorical features.
Pca
Principal Component Analysis.
PolynomialFeatures
Generate polynomial and interaction features.
RobustScaler
Scale features using the median and inter-quartile range (IQR).
SimpleImputer
Imputes missing (NaN) values in each feature column.
StandardScaler
Standardize features by removing the mean and scaling to unit variance.

Enums§

DropStrategy
Strategy for dropping one-hot columns to avoid multicollinearity.
Norm
Norm type for row-wise normalization.
Strategy
Strategy for computing the replacement value per feature.
UnknownStrategy
Strategy for handling categories seen at transform time but not at fit time.

Traits§

Transformer
A data transformer that can be fitted on a dataset and applied to transform it.