use-ml-evaluation
Evaluation run and validation metadata primitives for RustUse.
Experimental
use-ml-evaluation is experimental while use-ml remains below 0.3.0.
Example
use ;
let run_id = new?;
let threshold = new?;
let kind: MlEvaluationKind = "cross-validation".parse?;
assert_eq!;
assert_eq!;
assert_eq!;
# Ok::
Scope
- Evaluation run IDs, kinds, validation strategies, targets, statuses, slices, and benchmarks.
- Threshold metadata and confusion-matrix shape metadata.
- Generic ML evaluation metadata only.
Non-goals
- Computing evaluation metrics beyond trivial metadata validation.
- LLM-as-judge, prompt evaluation, conversation evaluation, hallucination checks, safety/guardrail evaluation, or retrieval-groundedness evaluation.
License
Licensed under either Apache-2.0 or MIT.