Expand description
Evaluation result reporting
Structures for representing and formatting evaluation results.
Structs§
- Evaluation
Report - Complete evaluation report for a test file or eval set
- Evaluation
Result - Result for a single test case
- Evaluation
Summary - Summary statistics for an evaluation run
- Failure
- A single failure in evaluation
- Turn
Result - Result for a single conversation turn
Type Aliases§
- Test
Case Result - Result for a single test case (alias for backward compatibility)