flash_rerank 0.2.0

Core reranking engine — cross-encoder and ColBERT inference via ONNX Runtime
Documentation

Flash-Rerank -- Blazing-fast neural reranking engine.

Provides cross-encoder and ColBERT inference via ONNX Runtime with TensorRT, CUDA, and CPU execution providers.