rust-brotli-decompressor
What's new in version 2.2.x
- into_impl for reader and writer classes
- removed BrotliStateCleanup since it happens upon drop()
What's new in version 2.1.2
- Better handling of transient errors ( fixes #4 )
- Do not panic in debug mode on unexpected bytes (handle arithmetic overflow)
- Smaller stack allocations
- Never create slice::from_raw_parts with nil
- Better panic reporting to C FFI
- Backport fixes to brotli issues 502 and 506
What's new in version 2.0.0
- Legacy Custom dictionaries (mostly useful for testing multithreaded brotli encoding and experimentation)
- New mechanism to request that the library should only depend on custom allocations: the --no-default-features flag since --features=std is on by default.
- Fully compatible C FFI to match the https://github.com/google/brotli C API and become a drop-in replacement
Project Requirements
Direct no-stdlib port of the C brotli decompressor to Rust if --no-default-features is passed into the build
no dependency on the Rust stdlib: this library would be ideal for decompressing within a rust kernel among other things.
This will be useful to see how C and Rust compare in an apples-to-apples comparison where the same algorithms and data structures and optimizations are employed.
The current expected performance losses come from
- an extra indirection in the hgroups
- array bounds checks on every access
- no ability to load a full aligned 64 bit or 128 bit item from a [u8]
the system also enables all syscalls to be "frontloaded" in the initial generation of a memory pool for the allocator. Afterwards, SECCOMP can be activated or other mechanisms can be used to secure the application, if desired
Linking rust-brotli-decompressor with C code using the zero-cost rust FFI abstraction
This library has FFI exports which comply with the original C interfaces. To build them, enter the c directory and just type make there. That will build a small example program and the cdylib with the appropriate ffi in place to link against
the example, called c/main.c shows how to decompress a program using the streaming interface and the nonstreaming interface.
If a nostdlib version is desired, then an unstable rust must be used (to enable the custom panic handler) and then the BrotliDecoderDecompress function is deactivated since that has no facilities for specifying a custom malloc
a customized malloc must be used if a nostdlib build is chosen and additionally the no-stdlib-ffi-binding cargo feature must be set eg
cargo build --features='no-stdlib no-stdlib-ffi-binding' --release
Usage
With the io::Read abstraction
let mut input = new;
then you can simply read input as you would any other io::Read class
With the Stream Copy abstraction
match BrotliDecompress
With manual memory management
There are 3 steps to using brotli without stdlib
- setup the memory manager
- setup the BrotliState
- in a loop, call BrotliDecompressStream
in Detail
// at global scope declare a MemPool type -- in this case we'll choose the heap to
// avoid unsafe code, and avoid restrictions of the stack size
declare_stack_allocator_struct!;
// at local scope, make a heap allocated buffers to hold uint8's uint32's and huffman codes
let mut u8_buffer = define_allocator_memory_pool!;
let mut u32_buffer = define_allocator_memory_pool!;
let mut hc_buffer = define_allocator_memory_pool!;
let heap_u8_allocator = new_allocator;
let heap_u32_allocator = new_allocator;
let heap_hc_allocator = new_allocator;
// At this point no more syscalls are going to be needed since everything can come from the allocators.
// Feel free to activate SECCOMP jailing or other mechanisms to secure your application if you wish.
// Now it's possible to setup the decompressor state
let mut brotli_state = new;
// at this point the decompressor simply needs an input and output buffer and the ability to track
// the available data left in each buffer
loop
This interface is the same interface that the C brotli decompressor uses
Also feel free to use custom allocators that invoke Box directly. This example illustrates a mechanism to avoid subsequent syscalls after the initial allocation