Skip to main content

Crate axonml_quant

Crate axonml_quant 

Source
Expand description

Axonml Quant - Model Quantization Library

§File

crates/axonml-quant/src/lib.rs

§Author

Andrew Jewell Sr - AutomataNexus

§Updated

March 8, 2026

§Disclaimer

Use at own risk. This software is provided “as is”, without warranty of any kind, express or implied. The author and AutomataNexus shall not be held liable for any damages arising from the use of this software.

Re-exports§

pub use calibration::CalibrationData;
pub use calibration::calibrate;
pub use dequantize::dequantize_block;
pub use dequantize::dequantize_tensor;
pub use error::QuantError;
pub use error::QuantResult;
pub use inference::QuantizedLinear;
pub use inference::QuantizedModel;
pub use inference::deserialize_quantized;
pub use inference::serialize_quantized;
pub use quantize::quantize_model;
pub use quantize::quantize_tensor;
pub use types::QuantType;
pub use types::QuantizedBlock;
pub use types::QuantizedTensor;

Modules§

calibration
Calibration for Quantization
dequantize
Dequantization Functions
error
Quantization Error Types
inference
Quantized Inference — fast inference with quantized weights
quantize
Quantization Functions
types
Quantization Types

Constants§

DEFAULT_BLOCK_SIZE
Default block size for quantization.
MAX_BLOCK_SIZE
Maximum block size supported.