Skip to main content

Module eval

Module eval 

Source
Expand description

Downstream task evaluation framework for search quality.

Measures how well the search pipeline supports actual coding tasks:

  • Retrieval precision/recall against known-relevant chunks
  • Mean Reciprocal Rank (MRR) for expected top results
  • Normalized Discounted Cumulative Gain (nDCG)

Designed to compare BM25-only vs hybrid search and track quality over time.

Structs§

EvalQuery
A single evaluation query with expected relevant results.
EvalReport
Result of evaluating a search system against a query set.
QueryScore
RetrievedItem
Retrieved result for evaluation (file path + score).

Functions§

evaluate
Evaluate search results against a set of queries with known relevance.