Expand description
Data preprocessing utilities for scientific computing
This module provides common data preprocessing operations used across the SciRS2 ecosystem, including scaling, encoding, imputation, and outlier detection.
§Scalers
StandardScaler- Standardize features by removing the mean and scaling to unit varianceMinMaxScaler- Scale features to a given range (default [0, 1])RobustScaler- Scale features using statistics robust to outliers (median, IQR)MaxAbsScaler- Scale each feature by its maximum absolute value
§Encoders
LabelEncoder- Encode string labels as integersOneHotEncoder- Encode categorical features as one-hot numeric arraysOrdinalEncoder- Encode categorical features as ordinal integers
§Imputation
Imputer- Fill missing values using various strategies
§Outlier Detection
OutlierDetector- Detect outliers using Z-score or IQR methods
Structs§
- Imputer
- Impute missing values in numeric data.
- Label
Encoder - Encode string (or any hashable) labels as integer indices.
- MaxAbs
Scaler - Scale each feature by its maximum absolute value so values are in [-1, 1].
- MinMax
Scaler - Scale features to a given range [feature_min, feature_max] (default [0, 1]).
- OneHot
Encoder - One-hot encode categorical features.
- Ordinal
Encoder - Encode categorical features as ordinal integers.
- Outlier
Detector - Detect outliers in numeric data.
- Robust
Scaler - Scale features using statistics robust to outliers.
- Standard
Scaler - Standardize features by removing the mean and scaling to unit variance.
Enums§
- Impute
Strategy - Strategy for imputing missing values.
- Outlier
Method - Method used for outlier detection.