Expand description
Library supporting converting CIFF to PISA uncompressed collection format.
Refer to osirrc/ciff on Github
for more detailed information about the format.
For more information about PISA’s internal storage formats, see the documentation.
§Examples
Use PisaToCiff and CiffToPisa builders to convert from one format
to another.
CiffToPisa::default()
.input_path(ciff_file)
.output_paths(&pisa_base_path)
.convert()?;
PisaToCiff::default()
.description("Hello, CIFF!")
.pisa_paths(&pisa_base_path)
.output_path(output)
.convert()?;Structs§
- Binary
Collection - Represents a single binary collection.
- Binary
Sequence - A single binary sequence.
- Ciff
ToPisa - CIFF to PISA converter.
- DocRecord
- Invalid
Format - Error raised when the bytes cannot be properly parsed into the collection format.
- Payload
Iter - Iterator over
PayloadSlice. - Payload
Slice - Payload slice is a slice of variable-sized elements (payloads) encoded in a single block of memory. This way, sequences of, say, strings, can be indexed into without loading all the elements in memory, but rather using a memory mapped buffer.
- Payload
Vector - Owning variant of
PayloadSlice, in which the underlying bytes are fully in memory within the struct. This is useful mainly for building the structure before writing it to a file, but also if one decides to fully load the bytes to memory and use it to assess elements without parsing the whole vector to aVec. - Pisa
ToCiff - PISA to CIFF converter.
- Posting
- Postings
List - Random
Access Binary Collection - A version of
BinaryCollectionwith random access to sequences.
Functions§
- build_
lexicon - Builds a lexicon using the text file at
inputand writes it tooutput. - ciff_
to_ pisa Deprecated - Converts a CIFF index stored in
pathto a PISA “binary collection” (uncompressed inverted index) with a basenameoutput. - concat
- Concatenate two
OsStrings. - encode_
u32_ sequence - Encodes a sequence of 4-byte unsigned integers into
writerin native-endianness. - pisa_
to_ ciff Deprecated - Converts a a PISA “binary collection” (uncompressed inverted index) with a basename
inputto a CIFF index stored inoutput.