Crate scrunch

Source
Expand description

§scrunch

Scrunch provides full-text-searching compression. This is a modified implementation of “High-Order Entropy-Compressed Text Indexes” by Grossi, Gupta, and Viter.

§Status

Active development. This library is incomplete and will change as it completes.

§Scope

This should give everything necessary to take a document, compress it, and serve queries over the compressed document.

§Warts

None known at the moment.

§Documentation

The latest documentation is always available at docs.rs.

Modules§

binary_search
bit_array
bit_vector
builder
encoder
isa
psi
sa
sais
Suffix Array-Induced Sort (sais, for short) is an algorithm to construct the suffix array of a string in linear time in the length of the string. A suffix array contains all possible suffixes of a string in sorted order such that SA[i] indicates that S[SA[i]..] would be the i’th suffix in sorted order.
sampled
sigma
Sigma is the greek character often used to represent an alphabet when dealing with languages. The sigma module provides a data structure for describing the alphabet used in a piece of text.
wavelet_tree

Structs§

Correlate
Exemplar
Exemplars
PsiDocument
RecordOffset
ReferenceDocument
TextOffset

Enums§

Error

Traits§

Document

Functions§

correlate
exemplars
exemplars_from_needle

Type Aliases§

CompressedDocument