Skip to main content

Module types

Module types 

Source
Expand description

Core types for the evaluation framework

Structsยง

AggregateMetrics
Aggregate metrics across queries
ChunkJudgment
Judgment for a single retrieved chunk
EvalConfig
Configuration for eval operations
EvalOutput
Eval output with full results
EvalRunConfig
Config recorded in eval output
GroundTruthEntry
A single ground truth entry (query paired with its source chunk)
JudgeCache
Persistent cache for LLM judge verdicts
JudgeCacheEntry
Single cache entry
JudgeVerdict
LLM judge verdict for a (query, chunk) pair
JudgmentEntry
A single judgment entry (written by Claude Code or external judge)
QueryResult
Per-query result with judge details
RetrievalResultEntry
Raw retrieval results for a single query
RetrievedChunk
A single retrieved chunk from a query