Expand description
ML-readiness evaluation module.
Validates that generated data is suitable for machine learning tasks including feature distributions, label quality, and graph structure.
Also provides baseline task definitions for benchmarking synthetic data.
Structs§
- Anomaly
Scoring Analysis - Results of anomaly scoring analysis.
- Anomaly
Scoring Analyzer - Analyzer for anomaly scoring quality.
- Anomaly
Scoring Thresholds - Thresholds for anomaly scoring analysis.
- Baseline
Config - Configuration for baseline evaluation.
- Baseline
Evaluation - Collection of baseline results for all tasks.
- Baseline
Result - Baseline task results.
- Baseline
Summary - Summary of baseline evaluation.
- Baseline
Task - ML baseline task definition.
- Classification
Metrics - Classification metrics for binary/multiclass tasks.
- Cross
Modal Analysis - Results of cross-modal consistency analysis.
- Cross
Modal Analyzer - Analyzer for cross-modal consistency.
- Cross
Modal Thresholds - Thresholds for cross-modal consistency analysis.
- Distribution
Sample - A pair of value distributions to compare.
- Domain
GapAnalysis - Results of domain gap analysis.
- Domain
GapAnalyzer - Analyzer for domain gap between synthetic and reference distributions.
- Domain
GapDetail - Detail for a single distribution comparison.
- Domain
GapThresholds - Thresholds for domain gap analysis.
- Embedding
Input - Input for embedding readiness analysis.
- Embedding
Readiness Analysis - Results of embedding readiness analysis.
- Embedding
Readiness Analyzer - Analyzer for embedding readiness.
- Embedding
Readiness Thresholds - Thresholds for embedding readiness analysis.
- Entity
Modal Data - Modal data for a single entity with tabular and graph feature vectors.
- Expected
Metrics - Expected performance metrics for a task.
- Feature
Analysis - Results of feature analysis.
- Feature
Analyzer - Analyzer for feature distributions.
- Feature
Quality Analysis - Results of feature quality analysis.
- Feature
Quality Analyzer - Analyzer for feature quality metrics.
- Feature
Quality Thresholds - Thresholds for feature quality analysis.
- Feature
Stats - Statistics for a single feature.
- Feature
Vector - A single feature vector with optional label values for importance estimation.
- GnnGraph
Data - Input graph data for GNN readiness analysis.
- GnnReadiness
Analysis - Results of GNN readiness analysis.
- GnnReadiness
Analyzer - Analyzer for GNN readiness.
- GnnReadiness
Thresholds - Thresholds for GNN readiness analysis.
- Graph
Analysis - Results of graph analysis.
- Graph
Analyzer - Analyzer for graph structure.
- Graph
Metrics - Basic graph metrics.
- Label
Analysis - Results of label analysis.
- Label
Analyzer - Analyzer for label quality.
- Label
Distribution - Distribution for a single label class.
- MLReadiness
Evaluation - Combined ML-readiness evaluation results.
- Ranking
Metrics - Ranking metrics for link prediction and recommendation.
- Regression
Metrics - Regression metrics for continuous prediction tasks.
- Scheme
Detectability Analysis - Results of scheme detectability analysis.
- Scheme
Detectability Analyzer - Analyzer for scheme detectability.
- Scheme
Detectability Thresholds - Thresholds for scheme detectability analysis.
- Scheme
Record - A single scheme record with difficulty and detection score.
- Scored
Record - A single record with an anomaly score and ground truth label.
- Split
Analysis - Results of split analysis.
- Split
Analyzer - Analyzer for train/test splits.
- Split
Metrics - Metrics for a single split.
- Temporal
Fidelity Analysis - Results of temporal fidelity analysis.
- Temporal
Fidelity Analyzer - Analyzer for temporal fidelity.
- Temporal
Fidelity Thresholds - Thresholds for temporal fidelity analysis.
- Temporal
Record - A single temporal record with a timestamp and associated value.
Enums§
- Baseline
Algorithm - Baseline algorithm for a task.
- MLTask
Type - ML task type for benchmarking.
- Performance
Grade - Performance grade for baseline results.
Functions§
- get_
accounting_ baseline_ tasks - Get predefined baseline tasks for synthetic accounting data.