use-ml-evaluation 0.0.1

Evaluation run and validation metadata primitives for RustUse.
Documentation
  • Coverage
  • 1.32%
    1 out of 76 items documented1 out of 29 items with examples
  • Size
  • Source code size: 13.08 kB This is the summed size of all the files inside the crates.io package for this release.
  • Documentation size: 1.25 MB This is the summed size of all files generated by rustdoc for all configured targets
  • Ø build duration
  • this release: 3s Average build duration of successful builds.
  • all releases: 3s Average build duration of successful builds in releases after 2024-10-23.
  • Links
  • Homepage
  • RustUse/use-ml
    1 0 0
  • crates.io
  • Dependencies
  • Versions
  • Owners
  • CloudBranch

use-ml-evaluation

Evaluation run and validation metadata primitives for RustUse.

Experimental

use-ml-evaluation is experimental while use-ml remains below 0.3.0.

Example

use use_ml_evaluation::{MlEvaluationKind, MlEvaluationRunId, MlThreshold};

let run_id = MlEvaluationRunId::new("eval-001")?;
let threshold = MlThreshold::new(0.5)?;
let kind: MlEvaluationKind = "cross-validation".parse()?;

assert_eq!(run_id.as_str(), "eval-001");
assert_eq!(threshold.value(), 0.5);
assert_eq!(kind, MlEvaluationKind::CrossValidation);
# Ok::<(), use_ml_evaluation::MlEvaluationError>(())

Scope

  • Evaluation run IDs, kinds, validation strategies, targets, statuses, slices, and benchmarks.
  • Threshold metadata and confusion-matrix shape metadata.
  • Generic ML evaluation metadata only.

Non-goals

  • Computing evaluation metrics beyond trivial metadata validation.
  • LLM-as-judge, prompt evaluation, conversation evaluation, hallucination checks, safety/guardrail evaluation, or retrieval-groundedness evaluation.

License

Licensed under either Apache-2.0 or MIT.