nous-judge 0.3.0

Async LLM-as-judge evaluators for Nous — plan quality, adherence, task completion
Documentation

Async LLM-as-judge evaluators for Nous.

These evaluators run asynchronously after agent runs complete. They use a separate model call to assess quality dimensions that require language understanding.