Expand description
Evaluation Module for RAG++
Provides metrics and utilities for evaluating retrieval quality.
§Metrics
- Recall@K: Fraction of relevant items retrieved in top-K
- Precision@K: Fraction of retrieved items that are relevant
- MRR (Mean Reciprocal Rank): Average of reciprocal ranks of first relevant item
- NDCG@K: Normalized Discounted Cumulative Gain
- Hit Rate@K: Whether at least one relevant item is in top-K
Structs§
- Benchmark
Result - Result of a benchmark run.
- Benchmarker
- Benchmark runner for performance evaluation.
- Evaluation
Summary - Aggregated evaluation metrics across multiple queries.
- Evaluator
- Evaluator for computing retrieval metrics.
- Query
Evaluation - Evaluation result for a single query.