nous-judge 0.3.0

Async LLM-as-judge evaluators for Nous — plan quality, adherence, task completion
Documentation
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
//! Async LLM-as-judge evaluators for Nous.
//!
//! These evaluators run asynchronously after agent runs complete.
//! They use a separate model call to assess quality dimensions
//! that require language understanding.

pub mod anthropic_judge;
pub mod judge_provider;
pub mod plan_adherence;
pub mod plan_quality;
pub mod task_completion;

pub use anthropic_judge::AnthropicJudgeProvider;
pub use judge_provider::{JudgeProvider, MockJudgeProvider, parse_judge_scores};
pub use plan_adherence::PlanAdherence;
pub use plan_quality::PlanQuality;
pub use task_completion::TaskCompletion;