oxicuda-quant 0.1.1

GPU-accelerated quantization and model compression engine for OxiCUDA
Documentation
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
//! # Quantization Analysis Tools
//!
//! Utilities for understanding and optimising model compression.
//!
//! | Module        | Contents                                               |
//! |---------------|--------------------------------------------------------|
//! | `sensitivity` | [`LayerSensitivity`], [`SensitivityAnalyzer`]          |
//! | `metrics`     | [`CompressionMetrics`], [`ModelCompressionMetrics`]    |
//! | `policy`      | [`MixedPrecisionPolicy`] — greedy bit-width assignment |

pub mod metrics;
pub mod policy;
pub mod sensitivity;

pub use metrics::{CompressionMetrics, ModelCompressionMetrics};
pub use policy::MixedPrecisionPolicy;
pub use sensitivity::{LayerSensitivity, SensitivityAnalyzer};