Expand description
Data splitting utilities for machine learning workflows
This module provides tools for splitting datasets into training, validation, and test sets using various strategies:
train_test_split- Simple random train/test splitstratified_train_test_split- Stratified split preserving class proportionsKFold- K-fold cross-validationStratifiedKFold- Stratified K-fold cross-validationLeaveOneOut- Leave-one-out cross-validationTimeSeriesSplit- Time series cross-validation (expanding or sliding window)GroupKFold- Group K-fold (keeps groups intact)ShuffleSplit- Repeated random train/test splits
Structs§
- GroupK
Fold - Group K-fold cross-validation.
- KFold
- K-fold cross-validation splitter.
- Leave
OneOut - Leave-one-out cross-validation.
- Shuffle
Split - Repeated random train/test splits.
- StratifiedK
Fold - Stratified K-fold cross-validation.
- Time
Series Split - Time series cross-validation splitter.
Enums§
- Time
Series Mode - Time series split mode.
Functions§
- stratified_
train_ test_ split - Stratified train/test split that preserves the proportion of each class.
- train_
test_ split - Split data indices into training and test sets.
Type Aliases§
- Split
Indices - Indices for a single split (train indices, test indices).