pub struct EvaluationReport {
pub evaluation_id: EvaluationId,
pub scope: EvaluationScope,
pub comparison: ComparisonDesign,
pub verdict: EvaluationVerdict,
pub score: Option<String>,
pub confidence: EvaluationConfidence,
pub judgments: Vec<EvaluatorJudgment>,
pub metric_deltas: Vec<EvaluationMetricDelta>,
pub evidence_refs: Vec<EntityRef>,
pub usage: EvaluationUsage,
pub redacted_summary: String,
pub limitations: Vec<String>,
}Expand description
Top-level report returned by an evaluator.
Fields§
§evaluation_id: EvaluationIdStable evaluation id.
scope: EvaluationScopeScope this report evaluates.
comparison: ComparisonDesignComparison design actually used.
verdict: EvaluationVerdictTop-level verdict.
score: Option<String>Optional top-level score.
confidence: EvaluationConfidenceTop-level confidence.
judgments: Vec<EvaluatorJudgment>Per-subject or per-criterion judgments.
metric_deltas: Vec<EvaluationMetricDelta>Metric deltas for measured evaluations.
evidence_refs: Vec<EntityRef>Evidence refs used by this report.
usage: EvaluationUsageUsage captured during evaluation.
redacted_summary: StringBounded report summary.
limitations: Vec<String>Limitations or validation notes.
Implementations§
Source§impl EvaluationReport
impl EvaluationReport
Sourcepub fn new(
evaluation_id: EvaluationId,
scope: EvaluationScope,
comparison: ComparisonDesign,
verdict: EvaluationVerdict,
confidence: EvaluationConfidence,
redacted_summary: impl Into<String>,
) -> Self
pub fn new( evaluation_id: EvaluationId, scope: EvaluationScope, comparison: ComparisonDesign, verdict: EvaluationVerdict, confidence: EvaluationConfidence, redacted_summary: impl Into<String>, ) -> Self
Creates a report with no metric deltas.
Sourcepub fn with_usage(self, usage: EvaluationUsage) -> Self
pub fn with_usage(self, usage: EvaluationUsage) -> Self
Returns this report with usage attached.
Sourcepub fn with_judgment(self, judgment: EvaluatorJudgment) -> Self
pub fn with_judgment(self, judgment: EvaluatorJudgment) -> Self
Returns this report with one judgment appended.
Sourcepub fn with_metric_delta(self, metric_delta: EvaluationMetricDelta) -> Self
pub fn with_metric_delta(self, metric_delta: EvaluationMetricDelta) -> Self
Returns this report with one metric delta appended.
Sourcepub fn validate_confidence_contract(&self) -> Result<(), AgentError>
pub fn validate_confidence_contract(&self) -> Result<(), AgentError>
Validates that measured confidence is backed by comparison evidence and metric deltas.
Sourcepub fn validate_confidence_contract_for_request(
&self,
request: &EvaluationRequest,
) -> Result<(), AgentError>
pub fn validate_confidence_contract_for_request( &self, request: &EvaluationRequest, ) -> Result<(), AgentError>
Validates measured confidence against request-owned metric deltas.
Trait Implementations§
Source§impl Clone for EvaluationReport
impl Clone for EvaluationReport
Source§fn clone(&self) -> EvaluationReport
fn clone(&self) -> EvaluationReport
Returns a duplicate of the value. Read more
1.0.0 (const: unstable) · Source§fn clone_from(&mut self, source: &Self)
fn clone_from(&mut self, source: &Self)
Performs copy-assignment from
source. Read moreSource§impl Debug for EvaluationReport
impl Debug for EvaluationReport
Source§impl<'de> Deserialize<'de> for EvaluationReport
impl<'de> Deserialize<'de> for EvaluationReport
Source§fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
fn deserialize<__D>(__deserializer: __D) -> Result<Self, __D::Error>where
__D: Deserializer<'de>,
Deserialize this value from the given Serde deserializer. Read more
Source§impl PartialEq for EvaluationReport
impl PartialEq for EvaluationReport
Source§fn eq(&self, other: &EvaluationReport) -> bool
fn eq(&self, other: &EvaluationReport) -> bool
Tests for
self and other values to be equal, and is used by ==.Source§impl Serialize for EvaluationReport
impl Serialize for EvaluationReport
impl Eq for EvaluationReport
impl StructuralPartialEq for EvaluationReport
Auto Trait Implementations§
impl Freeze for EvaluationReport
impl RefUnwindSafe for EvaluationReport
impl Send for EvaluationReport
impl Sync for EvaluationReport
impl Unpin for EvaluationReport
impl UnsafeUnpin for EvaluationReport
impl UnwindSafe for EvaluationReport
Blanket Implementations§
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more