ciff 0.1.1

The inverted index exchange format as defined as part of the Open-Source IR Replicability Challenge (OSIRRC) initiative
Documentation

Common Index File Format (CIFF)

Rust License crates.io

What is CIFF?

Common Index File Format CIFF is an inverted index exchange format as defined as part of the Open-Source IR Replicability Challenge (OSIRRC) initiative. The primary idea is to allow indexes to be dumped from Lucene via Anserini which can then be ingested by other search engines. This repository contains the necessary code to read the CIFF into a format which PISA can use for building (and then searching) indexes.

Versions

We currently provide a Rust binary for converting CIFF data to a PISA canonical index.

Build

Just run cargo build --release to build the binary. It can then be executed: ./target/release/common-index-format