Flash-Rerank -- Blazing-fast neural reranking engine.
Provides cross-encoder and ColBERT inference via ONNX Runtime with TensorRT, CUDA, and CPU execution providers.
Flash-Rerank -- Blazing-fast neural reranking engine.
Provides cross-encoder and ColBERT inference via ONNX Runtime with TensorRT, CUDA, and CPU execution providers.