Expand description
Axonml Quant - Model Quantization Library
§File
crates/axonml-quant/src/lib.rs
§Author
Andrew Jewell Sr - AutomataNexus
§Updated
March 8, 2026
§Disclaimer
Use at own risk. This software is provided “as is”, without warranty of any kind, express or implied. The author and AutomataNexus shall not be held liable for any damages arising from the use of this software.
Re-exports§
pub use calibration::CalibrationData;pub use calibration::calibrate;pub use dequantize::dequantize_block;pub use dequantize::dequantize_tensor;pub use error::QuantError;pub use error::QuantResult;pub use inference::QuantizedLinear;pub use inference::QuantizedModel;pub use inference::deserialize_quantized;pub use inference::serialize_quantized;pub use quantize::quantize_model;pub use quantize::quantize_tensor;pub use types::QuantType;pub use types::QuantizedBlock;pub use types::QuantizedTensor;
Modules§
- calibration
- Calibration for Quantization
- dequantize
- Dequantization Functions
- error
- Quantization Error Types
- inference
- Quantized Inference — fast inference with quantized weights
- quantize
- Quantization Functions
- types
- Quantization Types
Constants§
- DEFAULT_
BLOCK_ SIZE - Default block size for quantization.
- MAX_
BLOCK_ SIZE - Maximum block size supported.