Skip to main content

Crate rig_retrieval_evals

Crate rig_retrieval_evals 

Source
Expand description

§rig-retrieval-evals

Retrieval and knowledge-base evaluation harness for Rig agents.

The crate gives you:

See the crate README for an end-to-end quickstart.

§Stability

The default build ships retrieval-quality evaluation only. Optional features add RAGAS-style judges, zero-waste ingestion checks, knowledge-gain scoring, and optional embedding novelty adapters.

Re-exports§

pub use dataset::GoldQuery;
pub use dataset::Qrels;
pub use dataset::RetrievedDoc;
pub use dataset::RetrievedSet;
pub use error::Error;
pub use error::Result;
pub use harness::RetrievalHarness;
pub use report::FreshnessQueryRollup;
pub use report::FreshnessReport;
pub use report::MetricCi;
pub use report::MetricDelta;
pub use report::MetricReport;
pub use report::MultiReport;
pub use report::QueryDelta;
pub use report::QueryReliability;
pub use report::RegressionGate;
pub use report::ReliabilityReport;
pub use report::ReportDiff;
pub use retrieval::HitRateAtK;
pub use retrieval::MapAtK;
pub use retrieval::Mrr;
pub use retrieval::NdcgAtK;
pub use retrieval::PrecisionAtK;
pub use retrieval::RecallAtK;
pub use retrieval::RetrievalMetric;
pub use retriever::Retriever;
pub use retriever::VectorStoreRetriever;
pub use retriever::retrieve_all;
pub use retriever::score_retriever;
pub use staleness::ConflictGroup;
pub use staleness::ConflictReport;
pub use staleness::CorpusVersions;
pub use staleness::StaleHit;
pub use staleness::StalenessAnnotation;
pub use staleness::StalenessReport;
pub use staleness::detect_conflicts;
pub use staleness::detect_stale_hits;
pub use synthetic::SyntheticConfig;
pub use synthetic::SyntheticCorpus;
pub use synthetic::SyntheticDoc;
pub use synthetic::generate as generate_synthetic;

Modules§

dataset
Labeled retrieval datasets (qrels) and accompanying corpus / answer files.
error
Error types for rig-retrieval-evals.
harness
Async driver that executes a Qrels against a VectorStoreIndexDyn and aggregates per-query metric scores.
report
Aggregation, serialization, and baseline diffing of per-query metric scores produced by crate::harness::RetrievalHarness.
retrieval
Retrieval-quality metrics.
retriever
A retriever abstraction over anything that maps a query to a ranked list of documents — not just vector stores.
staleness
Stale-content and conflict detection for retrieval results.
synthetic
Deterministic synthetic “needle in a haystack” dataset generation.