Skip to main content

Module metrics

Module metrics 

Source
Expand description

Compute IR metrics from pre-judged results (no API calls needed)

Reads judgments JSONL (produced by Claude Code or external judge) and retrieval results JSONL, computes standard IR metrics.

Functionsยง

compute_metrics_from_judgments
Compute metrics from pre-judged results
format_metrics_summary
Format metrics summary to stdout