Skip to main content

Module benchmarks

Module benchmarks

Expand description

Benchmark suite definitions for ML evaluation.

Provides standardized benchmark datasets for:

Anomaly detection (AnomalyBench-1K)
Fraud detection (FraudDetect-10K)
Data quality detection (DataQuality-100K)
Entity matching (EntityMatch-5K)
ACFE-calibrated fraud detection (ACFE-Calibrated-1K, ACFE-Collusion-5K)
Industry-specific fraud detection (Manufacturing, Retail, Healthcare, Technology, Financial Services)

Each benchmark defines:

Dataset size and composition
Ground truth labels
Evaluation metrics
Expected baseline performance

Re-exports§

pub use acfe::acfe_calibrated_1k;
pub use acfe::acfe_collusion_5k;
pub use acfe::acfe_management_override_2k;
pub use acfe::all_acfe_benchmarks;
pub use acfe::AcfeAlignment;
pub use acfe::AcfeCalibration;
pub use acfe::AcfeCategoryDistribution;
pub use industry::all_industry_benchmarks;
pub use industry::financial_services_fraud_5k;
pub use industry::get_industry_benchmark;
pub use industry::healthcare_fraud_5k;
pub use industry::manufacturing_fraud_5k;
pub use industry::retail_fraud_10k;
pub use industry::technology_fraud_3k;
pub use industry::IndustryBenchmarkAnalysis;

Modules§

acfe: ACFE-aligned fraud evaluation benchmarks.
industry: Industry-specific evaluation benchmarks.

Structs§

BaselineResult: Expected baseline result for a benchmark.
BenchmarkBuilder: Builder for creating benchmark suites.
BenchmarkSuite: A benchmark suite definition.
CostMatrix: Cost matrix for cost-sensitive evaluation.
DatasetSpec: Dataset specification.
EvaluationSpec: Evaluation specification.
FeatureSet: Feature set for the benchmark.
LeaderboardEntry: Leaderboard entry for benchmark results.
SplitRatios: Train/validation/test split ratios.

Enums§

BaselineModelType: Types of baseline models.
BenchmarkTaskType: Types of benchmark tasks.
MetricType: Types of evaluation metrics.

Functions§

all_benchmarks: Get all available benchmark suites.
anomaly_bench_1k: AnomalyBench-1K: 1000 transactions with known anomalies.
data_quality_100k: DataQuality-100K: 100K records for data quality detection.
entity_match_5k: EntityMatch-5K: 5K records for entity matching.
fraud_detect_10k: FraudDetect-10K: 10K transactions for fraud detection.
get_benchmark: Get a benchmark by ID.
graph_fraud_10k: GraphFraud-10K: 10K transactions with network structure.