mnemonist-quant 0.4.3

TurboQuant vector quantization for mnemonist — near-optimal MSE and inner-product quantizers

Coverage
80.22%
73 out of 91 items documented0 out of 51 items with examples
Size
Source code size: 93.21 kB This is the summed size of all the files inside the crates.io package for this release.
Documentation size: 6.15 MB This is the summed size of all files generated by rustdoc for all configured targets
Ø build duration
this release: 3m 32s Average build duration of successful builds.
all releases: 3m 32s Average build duration of successful builds in releases after 2024-10-23.
Links
Homepage
urmzd/mnemonist
4 1 0
crates.io
Dependencies
Versions
- 0.4.3 (2026-04-04)
Owners

mnemonist-quant

TurboQuant vector quantization for mnemonist — near-optimal MSE and inner-product quantizers.

Implements the algorithms from TurboQuant (arXiv:2504.19874):

TurboQuantMse — MSE-optimal quantizer using random rotation + Lloyd-Max codebooks
TurboQuantProd — unbiased inner-product quantizer (MSE + QJL residual)
CompressedEmbeddingStore — binary storage format for quantized embeddings

Usage

use mnemonist_quant::{TurboQuantMse, TurboQuantProd, CompressedEmbeddingStore};

References

TurboQuant: Redefining AI Efficiency with Extreme Compression — Google Research blog
TurboQuant: Online Vector Quantization with Near-Optimal Distortion Rate — arXiv:2504.19874
Optimal Quantization for Matrix Multiplication — arXiv:2502.02617
Quantization of Large Language Models with an Overdetermined Linear System — arXiv:2406.03482

License

See LICENSE in the repository root.