libsais-rs
An idiomatic and mostly safe API wrapper for the awesome and very fast library libsais by Ilya Grebnov.
⚠️ Warning: this crate is not yet battle-tested, there might be bugs. The API is still subject to small changes. Any kind of feedback and suggestions via the issue tracker is highly appreciated!⚠️
Functionality
This crate exposes the whole functionality of libsais. It might be useful to also check out the documentation of the original library.
- Suffix array construction for
u8/u16/i32/i64texts andi32/i64output arrays - Generalized suffix arrays
- Longest common prefix arrays and permuted longest common prefix arrays
- Burrows-Wheeler-Transform and reversal
- Optional multithreading support via the
openmpfeature (enabled by default)
Usage
This crate provides generic builder-like APIs for the whole functionality listed above. The following is a simple example of how to use this library to construct a suffix array in parallel:
use ;
let text = b"barnabasbabblesaboutbananas";
let suffix_array: = for_text
.in_owned_buffer
.multi_threaded
.run
.expect
.into_vec;
Please consult the documentation and the examples for more details on how to use this library.
Performance and benchmarks
This library only adds a few safety checks, which should not impact performance in a relevant way. A notable exception is the suffix array construction for i32 and i64 inputs. Please consult the documentation for details.
Below are the results of a small benchmark of suffix array construction algorithms available on crates.io. The input was the human genome, truncated to 2 GB. Details about this benchmark can be found here.
The excellent performance of libsais is one of the main reasons why this API wrapper crate was created.